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1142 News at a glance 
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1144 ‘The game has changed.’ 

Al triumphs at protein folding 

In milestone, software predictions 
finally match structures calculated from 
experimental data By R. F. Service 


1145 Why were salmon dying? 
The answer washed off the road 
Common tire chemical implicated in 
coho salmon kills By E. Stokstad 
REPORT BY Z. TIAN ET AL. 
10.1126/SCIENCE.ABD6951 


1146 Scientists fear no-deal Brexit as 
deadline looms 

U.K. researchers in limbo, weeks from 
crashing out of EU trade and travel rules 
By C. O'Grady 


1147 How cats get their stripes and spots 
Newly uncovered mechanism may pattern 
the coats of other mammals, too 

By E. Pennisi 


1148 Federal hospital data system falters 
at tracking pandemic 

HHS Protect data, which influence how 
pandemic supplies and support are allocated, 
conflict with other data sources 

By C. Piller 


1150 New challenges emerge for planned 
human challenge trials 

Scientists debate risks and benefits 

of intentionally exposing volunteers to 
dangerous virus By W. Cornwall 
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1151 Amid the cheering, some vaccines 
face questions 

A candidate from AstraZeneca and 

the University of Oxford has puzzling efficacy 
data By J. Cohen 


FEATURES 


1152 Saving sanctuaries 

Critics charge that sanctuaries for retired 
research chimpanzees are failing their 
animals. Can a new tool help? By D. Grimm 


INSIGHTS 


BOOKS ET AL. 
1156 Fodder for future scientists 


PERSPECTIVES 


1162 Quantum-limited sound attenuation 
Resonantly interacting atoms confined by 
lasers have implications for neutron stars 

By T. Schaefer 

REPORT p. 1222 





Celebrity chimpanzee Hercules is making new 
friends at Project Chimps. 
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1163 Enzyme formation by immune 
receptors 

Upon pathogen recognition, some plant 
immune receptors assemble into active 
enzyme complexes By L. Tian and X. Li 
RESEARCH ARTICLES pp. 1184 & 1185 


1165 Miniaturization of robots 

that fly on beetles’ wings 

The shock-absorbing wings of the 
rhinoceros beetle battle in-flight collisions 
By J. Sun 

REPORT p. 1214 


1166 Probing the dark side of the exciton 
Photoemitted electrons reveal large- 
momentum (“dark”) excitons in monolayer 
WSe, By M.X. Na and Z. Ye 

REPORT p. 1199 


1167 Constraints on selfish 

behavior in plants 

Plants overproduce roots to secure resources 
nearby but avoid costly trips to neighbors’ 
patches By M. Semchenko 

REPORT p. 1197 


1168 Stimulating the brain to restore 
vision 

High-definition brain prostheses are 
developed for treating blindness 

By M. S. Beauchamp and D. Yoshor 
RESEARCH ARTICLE p. 1191 


1170 Mario J. Molina (1943-2020) 
Visionary environmental chemist 
By K. A. Prather 


POLICY FORUM 


1171 Fossil electricity retirement 
deadlines for a just transition 

A 2035 deadline for decarbonizing 

U.S. electricity would strand only about 
15% of fossil capacity-years 

By E. Grubert 
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1174 Evidence-based hunting policy 
needed in Slovakia 
By M. Kutal and M. Dula 


1174 Computational social science: 
On measurement 
By A. X. Wu et al. 


1175 Chinese sturgeon needs urgent 
rescue 
By X. Zhou et al. 


1175 Errata 
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REVIEW 


1180 Microbiota 

The role of the microbiota in human 
genetic adaptation 

T. A. Suzuki and R. E. Ley 


REVIEW SUMMARY; FOR FULL TEXT: 
DX.DOI.ORG/10.1126/SCIENCE.AAZ6827 
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1181 Coronavirus 

Comparative host-coronavirus protein 
interaction networks reveal pan-viral disease 
mechanisms D. E. Gordon et al. 


RESEARCH ARTICLE SUMMARY; FOR FULL TEXT: 
DX.DOI.ORG/10.1126/SCIENCE.ABE9403 





1182 Innate immunity 

Enteroviral 3C protease activates the human 
NLRP1 inflammasome in airway epithelia 

K. S. Robinson et al. 


RESEARCH ARTICLE SUMMARY; FOR FULL TEXT: 
DX.DOI.ORG/10.1126/SCIENCE.AAY2002 


1183 Evolution 

Selection enhances protein evolvability 
by increasing mutational robustness and 
foldability J. Zheng et al. 


RESEARCH ARTICLE SUMMARY; FOR FULL TEXT: 
DX.DOI.ORG/10.1126/SCIENCE.ABB5962 


Plant science 


1184 Direct pathogen-induced assembly of 
an NLR immune receptor complex to 


form a holoenzyme S. Ma et al. 
RESEARCH ARTICLE SUMMARY; 
FOR FULL TEXT: 


DX.DOI.ORG/10.1126/SCIENCE.ABE3069 


1185 Structure of the activated ROQ1 
resistosome directly recognizing the 


pathogen effector XopQ R. Martin et al. 
RESEARCH ARTICLE SUMMARY; 

FOR FULL TEXT: 
DX.DOI.LORG/10.1126/SCIENCE.ABD9993 
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1186 Developmental biology 
The N-glycome regulates the endothelial-to- 
hematopoietic transition D. M. Kasper et al. 


1191 Neuroscience 
Shape perception via a high-channel-count 
neuroprosthesis in monkey visual cortex 


X. Chen et al. 
PERSPECTIVE p. 1168; PODCAST 
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1197 Plant science 

The exploitative segregation of plant roots 
C. Cabal et al. 

PERSPECTIVE p. 1167 


1199 Physics 

Directly visualizing the momentum-forbidden 
dark excitons and their dynamics in 
atomically thin semiconductors 

J. Madéo et al. 

PERSPECTIVE p. 1166 


1204 Spectroscopy 

Super-resolution lightwave tomography of 
electronic bands in quantum materials 

M. Borsch et al. 


1208 Coronavirus 

De novo design of potent and resilient 
hACE2 decoys to neutralize SARS-CoV-2 
T. W. Linsky et al. 









1214 Biomechanics 
Mechanisms of 
collision recovery 

in flying beetles and 
flapping-wing robots 
H. V. Phan and H. C. Park 
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1219 Tropical forest 

Long-term collapse in fruit availability 
threatens Central African forest megafauna 
E.R. Bush et al. 


1222 Quantum gases 
Universal sound diffusion in 

a strongly interacting Fermi gas 
P B. Patel et al. 
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1227 Coronavirus 

Robust neutralizing antibodies to 
SARS-CoV-2 infection persist for months 
A. Wajnberg et al. 
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Temperature limits to deep subseafloor life 
in the Nankai Trough subduction zone 

V. B. Heuer et al. 
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Low-income students lose ground 
By Christopher Avery et al. 
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Root systems of two different plants, stained 
with red and blue dye for fresh cut flowers. 

Root dyeing provided experimental validation 
of theoretical predictions regarding plants’ 
response to competition. 
Neighbor plants segregate 
their root systems in the 
Soil space but develop 
more roots locally when 

a competitor is close than 
when growing alone. See 
pages 1167 and 1197. 
Photo: Mikel Ponce 
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EDITORIAL 


Low-income students lose ground 


ncome inequality in college attendance and gradu- 
ation in the United States was troublingly large be- 
fore the pandemic. Without an aggressive infusion 
of federal support for schools and students, these 
already sizable gaps will likely widen into chasms. 
Historically, a poor child in the United States has 
had a 10% chance of eventually earning a college de- 
gree; for children from well-off families, its over 50%. 
The pandemic is widening these differences, thereby in- 
creasing poverty, reducing social mobility, and stunting 
economic growth. Data from the National Student Clear- 
inghouse show that first-time enrollment at colleges and 
universities in the United States decreased by 13% from 
fall 2019 to fall 2020. At community colleges, the tradi- 
tional gateway for disadvantaged students, freshman en- 
rollment plunged 19%. Drops were 
largest among Black (18.7%), His- 
panic (19.9%), and Native Ameri- 
can (23.2%) students. 

Low-income college students 
have historically been half as likely 
to graduate as their wealthier 
classmates. The pandemic is likely 
to worsen those odds. Effective 
remote learning requires a broad- 
band connection, laptop, and a 
quiet, private space. Low-income 
students are the least likely to have 
this critical set of resources. 

The hit to college enrollment 
may be even greater in fall 2021. Freshmen who entered 
college this past fall had only the tail end of their ad- 
missions cycle disrupted; the entire process has been 
turned upside down for next year’s freshmen. Low-in- 
come and first-generation students depend on teachers, 
counselors, and coaches to get them through the bu- 
reaucratic and academic hurdles of college applications. 
Upper-income students typically have a family member 
who is a college graduate to play this role. 

All schools are challenged by remote instruction, but 
those serving low-income students are also struggling 
with connecting their students to the internet and deliv- 
ering school meals. There is now little time to spare for 
the formal and informal college counseling that schools 
traditionally provide. A leading indicator of the weaken- 
ing support is a national drop in financial aid applica- 
tions: FAFSA (Free Application for Federal Student Aid) 
filings are down by 16% among high school seniors, with 
the largest decreases among poor students. 

The pandemic is not just hampering the process of 
applying for college, but also profoundly affecting stu- 





“Rising tuition and 
shrinking supports 


will further... 
intensify education 
inequality.” 








dent learning. Remote learning is difficult for most stu- 
dents, but financially secure families can afford to hire 
tutors, buy broadband connections, and provide com- 
puters for their children. In poorer families, there may 
not even be an adult at home to supervise children’s 
schooling: Low-paid service jobs can rarely be com- 
pleted remotely. A high school or college student from a 
low-income family may be watching over siblings, with 
all of them sharing a single electronic device to connect 
to the internet and coursework. 

Public schools and colleges need more resources 
to provide high-quality remote learning, as well as to 
prepare schools for safe, in-person learning. Yet states 
are slashing education budgets. Public colleges (which 
serve the majority of American college students) are 
still reeling from funding cuts of 
the Great Recession of 2007-2009, 
and if state support continues to 
plummet, they will be forced to 
raise tuition and cut critical ser- 
vices. Rising tuition and shrink- 
ing supports will further reduce 
college attendance and intensify 
education inequality. 

Public schools, from preschool 
through college, need a financial 
rescue. But the states that tradi- 
tionally fund these schools are fac- 
ing enormous drops in tax revenue 
and cannot answer the call. The 
good news is that there are several sensible proposals 
to boost school spending with federal funds. The dollars 
need to flow, and quickly. 

We cannot ask college students to borrow their way 
out of this pandemic, nor to work more, given the col- 
lapse of the labor market. Financial aid for college stu- 
dents must adjust to the economic devastation caused 
by the pandemic. The standard response of the aid 
system to a sharp downturn in a family’s income is to 
demand supplementary paperwork. This protocol is de- 
signed for unusual cases, but right now the modal fam- 
ily is worse off than a year ago. The aid process should 
be streamlined and simplified to quickly get money out 
the door. For example, students who have already quali- 
fied for public assistance (such as food stamps through 
the Supplemental Nutrition Assistance Program or 
school meals) could automatically get grant aid. 

Spending on education is an investment in the nation’s 
future. An aggressive federal response is critical for get- 
ting students and schools through this perilous time. 

-Christopher Avery, Susan Dynarski, Sarah Turner 
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Ramon Lugo, director of the Florida Space Institute at the University of Central Florida, 
which manages the 57-year-old Arecibo Observatory, after learning on 1 December that the already 
crippled radio telescope was destroyed when its 900-ton instrument platform collapsed. 


Mohsen Fakhrizadeh’s 
body was carried through the 
«- streets of Tehran, Iran. 


Killing of Iran nuclear scientist dims diplomatic hopes 


ast week’s assassination of a prominent scientist who 
led Iran’s clandestine nuclear weapons R&D program 
could chill prospects for reinvigorating a 2015 agree- 
ment impeding Iran’s path to an atomic bomb in 
exchange for economic concessions. Physicist Mohsen 
Fakhrizadeh—often referred to as Iran’s J. Robert 
Oppenheimer, who led the Manhattan Project in the United 
States during World War II—was slain in an attack on a high- 
way outside of Tehran. His death follows a July explosion at 
an advanced centrifuge facility at Iran’s Natanz nuclear site, 


Nature sets open-access fee 


PUBLISHING | The elite Nature family of 

33 research journals, including the flagship 
Nature, announced on 24 November it will 
begin to charge authors €9500 to publish an 
article immediately free to read. The fee is 
thought to be the highest for any journal for 
open access. The Nature Research publish- 
ing group says the amount is necessary to 
cover the costs of the full-time editors and 
others who produce its primary research 
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journals. The new policy is meant to satisfy 
open-access mandates by European funders 
seeking to eliminate subscription paywalls in 
order to speed the flow of scientific informa- 
tion. Nature Research intends to eventually 
convert its research journals to solely open 
access. The group also unveiled a pilot under 
which it will charge lower fees to determine 
whether a manuscript is worth consideration 
for one of six journals other than Nature, 
then peer review it, with an additional 
charge if the paper is ultimately accepted. 


Published by AAAS 


which Iran labeled an act of sabotage. Neither of the blows, 
which Iran blamed on Israel, is likely to deal more than a 
tactical setback to Iran’s nuclear ambitions, analysts say. Ali 
Akbar Salehi, president of the Atomic Energy Organization of 
Iran, says the country intends to carry on Fakhrizadeh’s work. 
But Iran’s actions in response to the assassination may de- 
termine the fate of the Iran nuclear deal, a multilateral pact 
that has gradually unraveled since President Donald Trump 
withdrew in May 2018. U.S. President-elect Joe Biden has said 
he hopes to revive it. 


Integrity survey stumbles 


RESEARCH INTEGRITY | The world’s larg- 
est multidisciplinary survey on research 
integrity is falling short of its goals after 
two-thirds of invited institutions declined to 
collaborate. That left researchers leading the 
Dutch National Survey on Research Integrity 
to scrape many email addresses and solicit 
responses on their own. The survey will 
close next week, but the team has gathered 
responses from less than 15% of 40,000 
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targeted participants: all academics working 
in the Netherlands. Ten out of the nation’s 

15 universities declined to participate, with 
some university presidents saying the survey 
was too focused on bad behaviors, such as 
data falsification. Survey leaders had assured 
universities that the study would not gener- 
ate an institutional ranking of misbehavior. 


Ocean conservation gets boost 


MARINE PLANNING | In a show of unity, 

14 countries this week pledged to begin to 
manage all their national waters sustainably 
by 2025. The countries—including Australia, 
Indonesia, and Chile—control nearly 

40 million square kilometers of ocean. The 
pledges lack specific targets, but instead 
offer a vision of ocean economies that 
protect marine resources and share benefits 
equitably. Led by Norway and Palau, the 
High Level Panel for a Sustainable Ocean 
Economy in 2018 commissioned some 

250 scientists from 45 countries to analyze 
potential solutions, such as improving fisher- 
ies management and cutting ship pollution. 
The nations also pledged to support a global 
goal of protecting 30% of the ocean by 2030. 


Permit denied for Pebble Mine 


CONSERVATION | The U.S. Army Corps 

of Engineers on 25 November delivered a 
potentially fatal blow to plans for a contro- 
versial gold and copper mine in the Alaskan 
headwaters of the world’s largest sockeye 
salmon runs. Corps officials determined the 
Pebble Mine was “contrary to the public 
interest” and denied a federal permit needed 
to build it. It was a surprise victory for mine 
opponents, including environmentalists and 
Alaska Native groups. In July, the corps had 
issued an environmental study concluding 
the mine posed little threat to the salmon. 
But in recent months, some conserva- 

tives spoke against the project, including 
President Donald Trump’s son, Donald 
Trump Jr., who has fished for salmon there. 
Mine company Officials said they would 
appeal the decision. 


ESA nominates next director 


SPACE | Space scientist Josef Aschbacher 
of Austria has been nominated as the next 
director general of the European Space 
Agency. An ESA insider, Aschbacher has 
since 2016 led the agency’s highly success- 
ful Earth observation program, its largest 
directorate. If the nomination is approved 
by ESA’s 22 member states on 17 December, 
Aschbacher will take the reins of the 

€6.7 billion agency from the current direc- 
tor general, Jan Wörner, in mid-2021. 
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Progeria treatment approved 


DRUG DEVELOPMENT | Progeria, an ultra- 
rare and fatal genetic disease that causes 
features of premature aging, now has its 
first approved treatment in the United 
States: a drug called lonafarnib. Fewer than 
200 people worldwide are diagnosed with 
the condition, caused by a mutation in a 
gene for a protein that acts as a scaffold for 
cell nuclei. Defective forms of the protein 
accumulate in cells, causing young children 
to show signs of aging, including plaque 
buildup in the arteries. Most patients die 
from stroke, heart attack, or heart failure 

by their teens. Lonafarnib, developed by 
academic researchers and licensed by Eiger 
Biopharmaceuticals, curbs the abnormal 
protein buildup. It’s far from a cure, but after 
11 years of follow-up on 62 treated progeria 
patients in two small clinical trials, the drug 
extended expected life span by an average of 
2.5 years. The Food and Drug Administration 
approved the drug on 20 November for both 
progeria and a set of related conditions 
known as progeroid laminopathies. 


CONSERVATION 
Australia to count koalas 





Japan primatologist fired 


ANIMAL WELFARE | International 
researchers are defending renowned 

Kyoto University primatologist Tetsuro 
Matsuzawa, after his dismissal on 

24 November for the improper handling of 
funds for construction work at two chim- 
panzee facilities. The dismissal stems from 
a university investigation citing Matsuzawa 
and others for the unauthorized use of 
funds and inappropriate handling of 
contract procedures involving 500 million 
yen ($4.8 million) to build large, inter- 
connected cages at two primate research 
institutes. A separate investigation by 

the Board of Audit of Japan identified an 
additional 600 million yen also mishan- 
dled, though no funds were diverted for 
private use or to other projects. Matsuzawa 
says these conclusions reflect misunder- 
standings of the contracts with the two 
companies hired for the project. Other 
researchers say the firing seems harsh, 
given that all funds were spent building 
outstanding habitats. 


This koala 
is recovering 
from burns 
from a wildfire. 
=, 





fter massive bushfires last year killed thousands of koalas and incinerated 

up to 85% of their habitat in some regions, the Australian government has 
announced a $13 million effort to help these iconic animals bounce back. 
The spending includes $1.5 million for a population survey. Koalas are tricky to 
count, as they tend to hide in the treetops and spread out across a landscape, 


so this survey will involve heat-sensing drones, sound recordings, dogs that detect 
scat, and data collected by nonscientist observers. The program also allots 

$1.5 million to boost koala health research, because chlamydia and other diseases 
are the second largest recorded source of their mortality. 
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STRUCTURAL BIOLOGY 


‘The game has changed, 
Al triumphs at protein folding 


In milestone, software predictions finally match structures 
calculated from experimental data 


By Robert F. Service 


rtificial intelligence (AI) has solved 

one of biology’s grand challenges: pre- 

dicting how proteins fold from a chain 

of amino acids into 3D shapes that 

carry out life’s tasks. This week, orga- 

nizers of a protein-folding competition 
announced the achievement by research- 
ers at DeepMind, a U.K.-based AI company. 
They say the DeepMind method will have 
far-reaching effects, among them 
dramatically speeding the creation 
of new medications. 

“What the DeepMind team has 
managed to achieve is fantastic and 
will change the future of structural 
biology and protein research; says 100 
Janet Thornton, director emeritus 
of the European Bioinformatics In- 
stitute. “This is a 50-year-old prob- 
lem,’ adds John Moult, a structural 
biologist at the University of Mary- 
land, Shady Grove, and co-founder 
of the competition, Critical Assess- 
ment of Protein Structure Predic- 
tion (CASP). “I never thought Pd 
see this in my lifetime.” 
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The body uses tens of thousands of dif- 
ferent proteins, each a string of dozens to 
hundreds of amino acids. The order of the 
amino acids dictates how the myriad pushes 
and pulls between them give rise to proteins’ 
complex 3D shapes, which, in turn, determine 
how they function. Knowing those shapes 
helps researchers devise drugs that can lodge 
in proteins’ crevices. And being able to syn- 
thesize proteins with a desired structure 
could speed development of enzymes to 


Getting real 
At the Critical Assessment of Protein Structure Prediction (CASP) competition, 
AlphaFold matched experimental findings on a measure of accuracy. 


= = CASP14 (2020) 





AlphaFold (2020) 





Structures of a protein that were predicted 
by artificial intelligence (blue) and experimentally 
determined (green) match almost perfectly. 


make biofuels and degrade waste plastic. 

For decades, researchers deciphered pro- 
teins’ structures using experimental tech- 
niques such as x-ray crystallography or 
cryo-electron microscopy (cryo-EM). But 
such methods can take years and don’t al- 
ways work. Structures have been solved for 
only about 170,000 of the more than 200 mil- 
lion proteins discovered across life forms. 

In the 1960s, researchers realized if they 
could work out all interactions within a pro- 
tein’s sequence, they could predict its shape. 
But the amino acids in any given sequence 
could interact in so many different ways that 
the number of possible structures was astro- 
nomical. Computational scientists jumped 
on the problem, but progress was slow. 

In 1994, Moult and colleagues launched 
CASP, which takes place every 2 years. En- 
trants get amino acid sequences for about 
100 proteins whose structures are not 
known. Some groups compute a structure 
for each sequence, while others determine 
it experimentally. The organizers then com- 
pare the computational predictions with the 
lab results and give the predictions a global 
distance test (GDT) score. Scores above 
90 on the 100-point scale are considered on 
par with experimental methods, Moult says. 

Even in 1994, predicted structures for 
small, simple proteins could match experi- 
mental results. But for larger, challenging 
proteins, computations’ GDT scores were 
about 20, “a complete catastrophe,’ says 
Andrei Lupas, a CASP judge and evolution- 
ary biologist at the Max Planck Institute for 
Developmental Biology. By 2016, competing 
groups had reached scores of about 40 for the 
hardest proteins, mostly by drawing insights 
from known structures of proteins that were 
closely related to the CASP targets. 

When DeepMind first competed, in 2018, 
its algorithm, called AlphaFold, relied on this 
comparative strategy. But Alpha- 
Fold also incorporated a computa- 
tional approach called deep learn- 
ing, in which the software is trained 
on vast data troves—in this case, 
the sequences and structures of 
known proteins—and learns to spot 
patterns. DeepMind won handily, 
beating the competition by an aver- 
age of 15% on each structure, and 
winning GDT scores of up to about 
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60 for the hardest targets. 
But the predictions were still 
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Easy 


Difficulty of protein structure prediction 
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Difficult 


too coarse, says John Jumper, who 
heads AlphaFold’s development at 
DeepMind. “We knew how far we 
were from biological relevance.” So 
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the team combined deep learning with an 
“attention algorithm” that mimics the way a 
person might assemble a jigsaw puzzle: con- 
necting pieces in clumps—in this case clus- 
ters of amino acids—and then searching for 
ways to join the clumps in a larger whole. 
Working with a computer network built 
around 128 machine learning processors, 
they trained the algorithm on all 170,000 or 
so known protein structures. 

And it worked. In this year’s CASP, Alpha- 
Fold achieved a median GDT score of 92.4. 
For the most challenging proteins, AlphaFold 
scored a median of 87, 25 points above the 
next best predictions. It even excelled at solv- 
ing structures of proteins that sit wedged in 
cell membranes, which are central to many 
human diseases but notoriously difficult 
to solve with x-ray crystallography. Venki 
Ramakrishnan, a structural biologist at the 
Medical Research Council Laboratory of Mo- 
lecular Biology, calls the result “a stunning 
advance on the protein folding problem.” 

All groups in this year’s competition im- 
proved, Moult says. But with AlphaFold, 
Lupas says, “The game has changed.” The or- 
ganizers even worried DeepMind may have 
cheated somehow. So Lupas set a special 
challenge: a membrane protein from a spe- 
cies of archaea, an ancient group of microbes. 
For 10 years, his team had tried to get its x- 
ray crystal structure. “We couldn’t solve it.” 

But AlphaFold had no trouble. It returned 
a detailed image of a three-part protein with 
two helical arms in the middle. The model 
enabled Lupas and his team to make sense of 
their x-ray data; within half an hour, they had 
fit their experimental results to AlphaFold’s 
predicted structure. “It’s almost perfect,” 
Lupas says. “They could not possibly have 
cheated on this. I don’t know how they do it.” 

As a condition of entering CASP, Deep- 
Mind—like all groups—agreed to reveal suf- 
ficient details about its method for other 
groups to re-create it. That will be a boon 
for experimentalists, who will be able to 
use structure predictions to make sense of 
opaque x-ray and cryo-EM data. It could also 
enable drug designers to work out the struc- 
ture of every protein in new and dangerous 
pathogens like SARS-CoV-2, a key step in the 
hunt for molecules to block them, Moult says. 

Still, AlphaFold doesn’t do everything well. 
In CASP, it faltered on one protein, an amal- 
gam of 52 small repeating segments, which 
distort each others’ positions as they assem- 
ble. Jumper says the team now wants to train 
AlphaFold to solve such structures, as well as 
those of complexes of proteins that work to- 
gether to carry out key functions in the cell. 

Even though one grand challenge has 
fallen, others will undoubtedly emerge. “This 
isn’t the end of something,” Thornton says. 
“It’s the beginning of many new things.” 
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WATER POLLUTION 


Why were salmon dying? 
The answer washed off the road 


Common tire chemical implicated in coho salmon kills 


By Erik Stokstad 


or decades, something in urban 
streams has been killing coho salmon 
in the U.S. Pacific Northwest. Even 
after Seattle began to restore salmon 
habitat in the 1990s, up to 90% of the 
adults migrating up certain streams 
to spawn would suddenly die after rain- 
storms. Researchers suspected the killer 
was washing off nearby roads, but couldn’t 
identify it. “This was a serious mystery,” 
says Edward Kolodziej, an environmental 
engineer at the University of Washington’s 
(UW’s) Tacoma and Seattle campuses. 
Online this week in Science, researchers 
led by Kolodziej report the 
primary culprit comes from ý, if FM, 
a chemical widely used to JM Ay” hy, 
protect tires from ozone, a Y 
reactive atmospheric gas. 
The toxicant, called 6PPD- 
quinone, leaches out of the 
particles that tires shed 
onto pavement. Even small 
doses killed coho salmon 
in the lab. “Its a brilliant 
piece of work,’ says Miriam 
Diamond, an environmen- 
tal chemist at the Univer- 
sity of Toronto. “They’ve 
done a tremendous job at 
sleuthing out a very challenging problem.” 
Manufacturers annually produce some 
3.1 billion tires worldwide. Tire rubber is 
a complex mixture of chemicals, and com- 
panies closely guard their formulations. 
Because tire particles are a common com- 
ponent of water pollution, researchers have 
been examining how they affect aquatic life. 
After Kolodziej arrived at UW’s Center for 
Urban Waters in 2014, he joined the effort to 
solve the coho salmon mystery. The group 
created a mixture of particles from nine 
tires—some bought new, others provided by 
two undergraduates who moonlight as me- 
chanics—to mimic what might wash off typ- 
ical highways. They found several thousand 
unidentified chemicals in the mixture. Post- 
doc Zhenyu Tian spent more than 2 years 
narrowing down the list, separating the 
molecules based on their electrical charge 
and other properties. By May 2019, he had 
narrowed the focus to about 50 unknown 
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Particles that erode from tires wash 
into streams used by coho salmon. 





chemicals, and then further work revealed 
the chemical formula of a prime suspect. “If 
youre looking for an unexplained toxicant 
that’s killing fish, we had the perfect instru- 
ments and expertise,” Kolodziej recalls. 

But what was it? A 2019 report from 
the Environmental Protection Agency on 
chemicals in recycled tires mentioned 
6PPD, which has a similar formula. The 
final clue was buried in an industry re- 
port from 1983, which contained the exact 
formula of 6PPD-quinone, the molecule 
created when 6PPD reacts with ozone. 
The team synthesized 6PPD-quinone and 
found it was highly lethal to coho salmon. 

Now, the team is working to understand 
how the chemical kills fish. 
Kolodziej and colleagues 
say other species of fish 
should also be evaluated 
for sensitivity. Because you 
can’t buy the molecule, 
Kolodziej’s team is making 
it. “My lab might even be 
the only place that actually 
has this,” he says. 

The researchers suspect 
the compound is pres- 
ent on busy roads every- 
where. They’ve found it 
washes off pavement and 
into streams in Los An- 
geles and San Francisco, for example. The 
simplest solution might be for tire manu- 
facturers to switch to an environmentally 
benign alternative. But Sarah Amick, vice 
president of environment, health, safety, 
and sustainability at the U.S. Tire Manu- 
facturers Association, says it’s too early to 
discuss alternatives. “It’s important that 
additional research be done to validate and 
verify these results.” 

Another way to protect salmon is to fil- 
ter stormwater through soil, but installing 
enough infiltration basins to treat road 
runoff before it reaches spawning streams 
would be very expensive, says co-author 
Jenifer McIntyre, an ecotoxicologist at 
Washington State University’s Puyallup 
Research and Extension Center. In the 
meantime, Kolodziej says he “can’t walk 
along a street without staring at all the 
skid marks,” thinking about tire chemicals, 
and “wondering what’s there.” 
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EUROPE 


Scientists fear no-deal Brexit as deadline looms 


U.K. researchers in limbo, weeks from crashing out of EU trade and travel rules 


By Cathleen O’Grady 


rexit is set to become a reality, but 

its likely consequences for research- 

ers are still emerging. The United 

Kingdom left the European Union on 

31 January, but has remained part of 

EU trade and travel agreements while 
the final Brexit deal is negotiated. On 1 Janu- 
ary 2021, those ties will be severed—and a 
Brexit deal has yet to materialize. 

The future of research funding, interna- 
tional collaboration, and wrinkle-free supply 
chains of lab stocks hinge on the details of the 
final deal, but those issues are “some 
way back in the queue” behind the 
sticking points of a trade agreement, 
says James Wilsdon, a science policy 
expert at the University of Sheffield. 

For U.K. scientists, the biggest ques- 
tion is whether they can be a part of the 
€85 billion 2021-27 Horizon Europe re- 
search funding program. Non-EU mem- 
bers can participate, but the United 
Kingdom is wavering over a potentially 
hefty price tag. The EU offer would 
see the United Kingdom pay in about 
£15 billion, plus a top-up payment if 
U.K. applicants win more than that in 
research grants. But because U.K. suc- 
cess rates in winning Horizon grants 
have fallen by almost one-third since 
the vote to leave Europe in 2016, it is 
likely to pay in much more than it gets 
out. Vivienne Stern, director of Univer- 
sities UK International, estimates the 
7-year premium at about £3 billion. Al- 
though Stern supports joining Horizon 
Europe, she told a parliamentary com- 
mittee on 22 October the price is too high. 

The benefits of international collaboration 
and access to diverse funding schemes could 
justify paying extra, but not such an “eye- 
watering” sum, says Kieron Flanagan, a 
science policy expert at the University of 
Manchester. U.K. research advocates have 
proposed a cap on any premium paid into 
Horizon Europe by non-EU countries, but the 
European Union has shown no sign of budg- 
ing since its initial offer in March, says Martin 
Smith, a policy manager at the Wellcome 
Trust, a U.K. philanthropic research funder. 
“Tm currently optimistic that a way forward 
on costs can be found,” he says, “but all this 
is moot if the wider negotiations collapse.” 
Wilsdon is more pessimistic and suggests Ho- 
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rizon participation could stall even if a trade 
deal squeaks through. 

Another issue that could disrupt research 
collaborations is data privacy. EU negotia- 
tors may deem U.K. data protection laws 
inadequate because of its broad use of sur- 
veillance, says Rosie Richards, head of digi- 
tal policy at the NHS Confederation, which 
represents U.K. health providers. That 


could disrupt studies including those from 
the COVID-19 Genomics UK Consortium, 
which is working with the European Bio- 
informatics Institute to track clinical data 
and changes in the coronavirus genome to 





establish whether they link to easier trans- 
mission of the virus or more severe disease. 

Less data sharing will also make it harder 
for the U.K. Medicines and Healthcare prod- 
ucts Regulatory Agency (MHRA) to approve 
drugs. The agency was a major player in the 
European Medicines Agency, which relocated 
from London to Amsterdam in 2019. In Janu- 
ary 2021, MHRA will go its own way, and it is 
likely to have a heftier workload looking at 
coronavirus drugs and vaccines, says Olivier 
Wouters, a health policy researcher at the 
London School of Economics. Brexit, he says, 
“couldn’t come at a more inopportune time.” 

New customs regulations could disrupt 
the flow of critical university lab supplies, 
says Helen Dodd-Williams, head of strategic 
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procurement at the North Western Univer- 
sities Purchasing Consortium (NWUPC). A 
supply gap in some stocks—such as the liquid 
helium used to chill MRI machines—would 
cause severe disruption, she says, while prod- 
ucts like antibodies have a short shelf life and 
can’t be stored long in advance. Suppliers are 
developing contingency plans, such as rout- 
ing lab supplies through other ports to avoid 
expected logjams in southeastern England. 
No matter what deal emerges, Brexit 
is sure to change the flow of researchers 
themselves. EU citizens wishing to work in 
the United Kingdom will now need a visa, 
with requirements for a job offer, a 
salary above a certain threshold, and 
English language testing. The U.K. 
government is offering a new “global 
talent visa” that eliminates some 
bureaucratic hoops for researchers 
and technicians named on grants— 
including ones from Horizon Europe. 
But the cost of a 5-year visa for a fam- 
ily of four is nearly £15,000. “It’s re- 
ally quite prohibitively expensive,” 
and might persuade researchers to 
land elsewhere in Europe, says James 
Tooze, a policy officer at the U.K. Cam- 
paign for Science and Engineering. 
The visa requirements will also af- 
fect essential university workers, like 
building cleaners. NWUPC’s suppliers 
are helping European employees with 
the paperwork that allows pre-Brexit 
residents to remain, but in the future 
they might not be able to recruit that 
labor so easily, Dodd-Williams says. 
Brexit talks are still ongoing, with 
stubborn disagreements over rules 
that would prevent businesses on either 
side having an unfair advantage because of 
labor and environmental standards, or state 
subsidies. But even if a deal is struck and 
ratified by both sides before the end of the 
year, many details will linger. If the United 
Kingdom does not join Horizon Europe 
now, it could still choose to join months or 
years down the line. And the fallout from 
the new immigration restrictions and cus- 
toms bureaucracy will take time to unfold. 
Flanagan says Brexit means a perma- 
nently different relationship between the 
United Kingdom and Europe that will de- 
mand constant shifts and negotiations. 
“This is what people don’t really under- 


” E 


stand,” he says. “It will never be over. 
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DEVELOPMENTAL GENETICS 


How cats 
set their 
stripes 
and spots 


Newly uncovered 
mechanism 

may pattern 

the coats of other 
mammals, too 


By Elizabeth Pennisi 


hen Rudyard Kipling told how 
the leopard got his spots, he 
missed the mark. Leopards have 
“rosettes”; spots are for cheetahs, 
says Gregory Barsh, a geneticist 
at the HudsonAlpha Institute 
for Biotechnology. But whatever you call 
the markings, how wild cats and their do- 
mestic counterparts acquire them has long 
been a mystery. Now, Barsh and his col- 
leagues have found an answer. In so doing, 
they have shown that a 70-year-old theory 
explaining patterns in nature holds true 
for fur color in cats, and likely other mam- 
mals as well. 

“This is an important paper unveiling 
part of the genetic basis [of] coat color 
markings so prominent in many mam- 
mals,” says Denis Headon, a developmen- 
tal biologist at the Roslin Institute. It also 
offers a glimpse of how those genes oper- 
ate during development, forming what he 
calls a “highly adaptable mechanism” that 
responds to genetic tweaks to produce di- 
verse coat patterns, from stripes to spots. 

Biologists have identified hair follicle 
cells as the source of the black, brown, yel- 
low, and red pigments that color hair or 
fur. “But we didn’t know when and where 
the process of the establishment of the 
color pattern took place,’ Barsh says. 

In 1952, computing pioneer Alan Turing 
suggested molecules that inhibit and 
activate each other could create peri- 
odic patterns in nature if they diffused 
through tissue at different rates. Thirty 
years later, other scientists applied his 
theory to develop a hypothesis about 
how spots, stripes, and other color pat- 
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terns form during development. In this 
scheme, activator molecules color a cell 
but also trigger the production of inhibi- 
tors, which diffuse faster than the activa- 
tors and can shut off pigment production. 
Last year, that idea was proved correct in 
plants called monkeyflowers: Research- 
ers showed that dark, activated speck- 
les on the petals become ringed with 
unpigmented tissue as inhibitors spread 
(Science, 30 August 2019, p. 854). And re- 
searchers had shown molecules following 
the Turing pattern help trigger the devel- 
opment of hair follicles in mice. But how 
coat color develops in mammals remained 
largely mysterious because mice and 
other easy-to-study lab animals lack spots 
or stripes. 

So Barsh’s team turned to domestic cats 
to track the identity of molecular activators 
and inhibitors of coat color. A decade ago, 
they tracked down a gene, Tabby, that, when 
mutated, gives tabby cats black blotches in- 
stead of their usual dark stripes. Hudson- 
Alpha geneticist Christopher Kaelin 
found that same mutation in king chee- 
tahs whose spots were unusually big and 
blotchy, suggesting the same genes color 
both wild and domestic cats. 

To see what other genes and their mu- 
tations operate during development, 
Kaelin and HudsonAlpha colleague Kelly 
McGowan spent several years collecting 
discarded tissue from clinics that spay fe- 
ral cats, which are often pregnant. They 
first noticed temporary thickenings of the 
skin of 28- to 30-day-old embryos, where 
black stripes would later appear in the 
fur. “There’s a change [in the skin] that 
precedes and mimics what you observe in 
adult [fur]; McGowan explains. 
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A mutated gene gives this tabby blotchy spots 
instead of the typical stripes. 


The researchers then isolated and se- 
quenced the active genes in individual skin 
cells of early embryos. At about 20 days old, 
embryos showed a sharp increase in the ac- 
tivity of several genes involved in a key devel- 
opmental pathway, known as Wnt signaling, 
in skin areas destined to briefly thicken be- 
fore the area becomes permanently dark. 
One of the most active genes was Dkk4, as 
they reported on 16 November in a preprint 
on bioRxiv. The team also found that muta- 
tions that inactivated Dkk4 accounted for 
the loss of distinct markings in Abyssinian 
and Singpura breeds, making their spots 
too small to distinguish. Tabby and Dkk4 
“are in the same pathway,’ and likely work 
in both domestic and wild cats, Barsh ex- 
plains, though he doesn’t yet know how 
they are connected. 

Dkk4 is a known inhibitor of Wnt sig- 
naling, which helps determine cell fates 
and spurs cell growth during development 
in many animals. The team found that in 
domestic cats, Wnt and Dkk4, respectively, 
are the activator and inhibitor. In dark 
skin, they exist in about equal amounts. 
But in paler areas, the faster moving Dkk4 
protein most likely turns off Wnt, shutting 
down pigment production and thereby 
generating stripes, just as Turing’s theory 
had predicted. “It is remarkable, although 
not altogether surprising, that we see Wnt- 
Dkk4 signaling again playing a critical early 
role,” says Larissa Patterson, a developmen- 
tal biologist at Rhode Island College. 

“This paper provides thought-provoking 
insights into potential mechanisms of pat- 
tern diversity in wild cats,” Patterson adds. 
It “greatly adds to the evidence” that this 
process is at work in cats and, most likely, 
other mammals, agrees Roland Baddeley, 
a computational neuroscientist at the Uni- 
versity of Bristol. 

Researchers had already shown the 
Turing mechanism involving Wnt and Dkk4 
sets up the formation of hair follicles—but 
not coat color—later in mouse develop- 
ment. Barsh’s team, however, found that 
the color pattern in cats and possibly other 
mammals is established well before hair 
follicles appear, suggesting early color pat- 
terns may guide hair follicle pigmentation. 

That simple interactions among well- 
known molecules can explain the variety 
of coat color patterns in mammals is an ex- 
ample of nature’s thriftiness, Headon says. 
“It suggests that the same molecules and 
pathways are likely to be reused for pat- 
terning of very different structures and at 
very different scales to form the intricate 
elements of the vertebrate anatomy.” 
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Federal hospital data system 
falters at tracking pandemic 


HHS Protect data, which influence how pandemic supplies 
and support are allocated, conflict with other data sources 


By Charles Piller 


n mid-November, as the United States set 

records for newly diagnosed COVID-19 

cases day after day, the hospital situa- 

tion in one hard-hit state, Wisconsin, 

looked concerning but not yet urgent by 

one crucial measure. The main pandemic 
data tracking system run by the Depart- 
ment of Health and Human Services (HHS), 
dubbed HHS Protect, reported that on 16 No- 
vember, 71% of the state’s hospital beds were 
filled. Wisconsin officials who rely on the 
data to support and advise their increasingly 
strained hospitals might have concluded 
they had some margin left. 

Yet a different federal COVID-19 data sys- 
tem painted a much more dire picture for 
the same day, reporting 91% of Wisconsin’s 
hospital beds were filled. That day was no 
outlier. A Science examination of HHS Pro- 
tect and confidential federal documents 
found the HHS data for three important 
values in Wisconsin hospitals—beds filled, 
intensive care unit (ICU) beds filled, and in- 
patients with COVID-19—often diverge dra- 
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matically from those collected by the other 
federal source, from state-supplied data, and 
from the apparent reality on the ground. 
“Our hospitals are struggling,” says Jeffrey 
Pothof, a physician and chief quality officer 
for the health system of the University of 
Wisconsin (UW), Madison. During recent 
weeks, patients filled the system’s COVID-19 
ward and ICU. The university’s main hospital 
converted other ICUs to treat the pandemic 
disease and may soon have to turn away 
patients referred to the hospital for special- 
ized care. Inpatient beds—including those in 
ICUs—are nearly full across the state. “That’s 
the reality staring us down,” Pothof says, add- 
ing: The HHS Protect numbers “are not real.” 
HHS Protect’s problems are a national 
issue, an internal analysis completed this 
month by the Centers for Disease Control 
and Prevention (CDC) shows. (The report is 
available at https://scim.ag/36saba3.) That 
analysis, other federal reports, and emails 
obtained by Science suggest HHS Protect’s 
data do not correspond with alternative hos- 
pital data sources in many states. “The HHS 
Protect data are poor quality, inconsistent 
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COVID-19 wards like this one at University of 
Wisconsin Health in Madison are filling up fast, 
but federal data may not show that. 


with state reports, and the analysis is slip- 
shod,” says one CDC source who had read the 
agency’s analysis and requested anonymity 
because of fear of retaliation from the Trump 
administration. “And the pressure on hospi- 
tals [from COVID-19] is through the roof.’ 

Both federal and state officials use HHS 
Protect’s data to assess the burden of dis- 
ease across the country and allocate scarce 
resources, from limited stocks of COVID-19 
medicines to personal protective equipment 
(PPE). Untrustworthy numbers could lead to 
supply and support problems in the months 
ahead, as U.S. cases continue to rise during 
an expected winter surge, according to cur- 
rent and former CDC officials. HHS Protect 
leaders vigorously defend the system and 
blame some disparities on inconsistent state 
and federal definitions of COVID-19 hospi- 
talization. “We have made drastic improve- 
ments in the consistency of our data ... even 
from September to now,” says one senior 
HHS official. (Three officials from the de- 
partment spoke with Science on the condi- 
tion that they not be named.) 

CDC had a long-running, if imperfect, hos- 
pital data tracking system in place when the 
pandemic started, but the Trump administra- 
tion and White House Coronavirus Task Force 
Coordinator Deborah Birx angered many in 
the agency when they shifted much of the 
responsibility for COVID-19 hospital data 
in July to private contractors. TeleTracking 
Technologies Inc., a small Pittsburgh-based 
company, now collects most of the data, while 
Palantir, based in Denver, helps manage the 
database. At the time, hundreds of public 
health organizations and experts warned the 
change could gravely disrupt the ability to 
understand the U.S. pandemic and mount a 
response (Science, 23 October, p. 394). 

The feared data chaos now seems a real- 
ity, evident when recent HHS Protect figures 
are compared with public information from 
states or data documented by another hos- 
pital tracking system run by the HHS Office 
of the Assistant Secretary for Preparedness 
and Response. ASPR manages the Strategic 
National Stockpile of medicines, PPE—in 
perilously short supply in many areas—and 
other pandemic necessities. ASPR collects 
data nationwide to help states and hospitals 
respond to the pandemic, although it is less 
comprehensive than HHS Protect. 

In Alabama, HHS Protect figures differ by 
15% to 30% from daily state COVID-19 in- 
patient totals. Karen Landers, assistant state 
health officer, said nearly all of the state’s 
hospitals report data to HHS via the Ala- 
bama Department of Public Health. Although 
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reporting delays sometimes prevent the sys- 
tems from syncing precisely, Landers says, 
she cannot account for the sharp differences. 

Many state health officials contacted by 
Science were reluctant to directly criticize 
HHS Protect or attribute supply or support 
problems to its data. Landers notes that 
Alabama relies on its own collected data, 
rather than HHS Protect’s, for its COVID-19 
response. “We are very confident in our 
data,” she says, because the state reporting 
system was developed over several years and 
required little adjustment to add COVID-19. 
HHS, she adds, has generally been respon- 
sive to state requests for medicines and 
supplies, although Alabama has not always 
gotten all the PPE it has requested. 

Other states, however, say they do rely 
on HHS Protect. A spokesperson for the 
Wisconsin Department of Health Ser- 
vices wrote in a response to questions, 
“When making decisions at the state 
level we use the HHS Protect data,’ but 
declined to comment about its accuracy. 
HHS informed Wisconsin officials that 
it distributes scarce supplies based on 
need indicated by HHS Protect data, the 
spokesperson wrote. 

Pothof says UW’s hospital system has 
its own sophisticated data dashboard 
that draws on state, local, and internal 
sources to plan and cooperate on pan- 
demic response with other hospitals. But 
small hospitals in Wisconsin—now expe- 
riencing shortages of some medicines, 
PPE, and other supplies—are more de- 
pendent on federal support largely based 
on HHS Protect data. Help might not ar- 
rive, Pothof says, if the data show “things 
look better than they are.” 

If the HHS Protect data are suspect, 
“that’s a very large problem,” says Nancy 
Cox, former director of CDC’s influenza 
division and now an affiliated retiree 
of the agency. If HHS officials use bad 
data, they will not distribute medicines 
and supplies equitably, Cox notes, add- 
ing: “Undercounting in the hardest hit 
states means a lower level of care and 
will result in more severe infections and 
ultimately in more deaths.” 

Birx and the other managers of HHS Pro- 
tect “really had no idea what they were doing,” 
says Tom Frieden, CDC director under former 
President Barack Obama. (Birx declined to 
comment for this article.) Frieden cautions 
that ASPR data might also be erroneous— 
pointing to the need for an authoritative 
and clear federal source of hospital data. 
The original CDC system, called the National 
Healthcare Safety Network (NHSN), should 
be improved, he said, but it handles nursing 
home COVID-19 data skillfully and could do 
the same with hospitals. NHSN is “not just a 
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reliable results. 


Conflicting bed counts 
Two Department of Health and Human Services (HHS) 


sections collect hospital patient data to battle the COVID-19 
pandemic—HHS Protect and the Office of the Assistant 
Secretary for Preparedness and Response (ASPR). Their data 
sometimes conflict sharply, as shown in this 16 November 
summary of intensive care unit (ICU) beds occupied in six 
hard-hit Midwestern states. The difference noted parentheti- 


cally is ASPR’s bed total minus HHS Protect’s. 
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computer program. It’s a public health pro- 
gram” built over 15 years and based on re- 
lationships with individual health facilities, 
Frieden says. (CDC insiders say HHS officials 
recently interfered with publication of an 
analysis showing that NHSN performed well 
early in the pandemic. See https://scim.ag/ 


An HHS official says HHS Protect’s data 
are complex and the department can’t ver- 
ify any findings in the reports reviewed by 
Science without conducting its own analysis, 
which it did not do. But the official says HHS 
Protect has improved dramatically in the 
past 2 months and provides consistent and 
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As for the difference between state and 
HHS Protect data, an HHS official con- 
tends state numbers “are always going to 
be lower” by up to 20%. That’s because hos- 
pitals could lose Medicare funding if they 
do not report to HHS, the official says, but 
face no penalty for failing to report to the 
state. So rather than expect identical num- 
bers, HHS looks for state and federal data 
to reflect the same trajectory—which they 
do in all cases for the number of COVID-19 
inpatients, according to another confiden- 
tial CDC analysis of HHS Protect, covering 





Yet the same analysis found 27 states re- 
cently alternated between showing more 
or fewer COVID-19 inpatients than HHS 
Protect—not always just fewer, as HHS 
says should be the case. Thirty states also 
showed differences between state and HHS 
Protect figures that were frequently well 
above the 20% threshold cited by HHS, 
and HHS Protect data fluctuated errati- 
cally in 21 states. 

“Hospital capacity metrics can and 
should be a national bellwether,” the CDC 
data expert says. “One important question 
raised by the discordant data reported by 
HHS Protect and the states is whether 
HHS Protect is systematically checking 

data validity.” HHS has not provided 
the methodology behind HHS Protect 
data estimates for review by indepen- 
dent experts. But an HHS official says a 
team of data troubleshooters, including 
CDC and ASPR field staff, work to re- 
solve anomalies and respond to spikes 
in cases in a State or hospital. 

Along with improving trust in its 
data, HHS Protect needs to make it 
more accessible, CDC data scientists 
say. The publicly accessible HHS Protect 
data are far less complete than the fig- 
ures in its password-protected database. 
This effectively hides from public view 
key pandemic information, such as local 
supplies of protective equipment. 

The site also does not provide graph- 
ics highlighting patterns and trends. 
This might explain, in part, why 
most media organizations—as well as 
President-elect Joe Biden’s transition 
team—instead have relied on state or 
county websites that vary widely in com- 
pleteness and quality, or on aggregations 
such as The Atlantic magazine’s COVID 
Tracking Project, which collects, orga- 
nizes, and standardizes state data. (In 
comparing state and federal data, CDC 
also used the COVID Tracking Project.) 

Frieden and other public health spe- 
cialists call reliable, clear federal data 
essential for an effective pandemic re- 
sponse. “The big picture is that we’re 

coming up to 100,000 hospitalizations 
within the next few weeks. Hospital sys- 
tems all over the country are going to be 
stressed,” Frieden says. “There’s not going 
to be any cavalry coming over the hill from 
somewhere else in the country, because 
most of the country is going to be over- 
whelmed. We’re heading into a very hard 
time with not very accurate information 
systems. And the government basically un- 
dermined the existing system.” 





This story was supported by the Science Fund 
for Investigative Reporting. 
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New challenges emerge for 
planned human challenge trials 


Scientists debate risks and benefits of intentionally 
exposing volunteers to dangerous virus 


By Warren Cornwall 


n the early days of the COVID-19 pan- 

demic, tens of thousands of young volun- 

teers offered to risk their health by letting 

scientists intentionally infect them with 

the pandemic coronavirus, hoping to 

speed the hunt for a vaccine or treatment. 
Several research groups announced plans to 
run these so-called human challenge trials, 
even as some scientists questioned whether 
they could be conducted ethically. 

Now, with recent news that conventional 
human trials have produced several promis- 
ing vaccines, scientists are debating whether 
planned challenge trials are still needed. In 
the United States, one nascent effort is on 
hold. In the United Kingdom, however, re- 
searchers say they are moving ahead. “There 
are still many strong arguments for pursu- 
ing” challenge trials, says immunologist 
Christopher Chiu of Imperial College Lon- 
don, leader of the proposed U.K. trial. 

In traditional human trials, researchers 
give volunteers either a vaccine or a placebo, 
then wait months or longer for enough cases 
to emerge to gather statistically rigorous re- 
sults. Challenge trials can move faster by first 
vaccinating volunteers and then intentionally 
exposing them to SARS-CoV-2 in a controlled 
setting. Researchers would then closely mon- 
itor the volunteers for days or weeks. 

But challenge trials are also rife with ethi- 
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cal concerns. For example, because there are 
no reliable treatments for severe COVID-19, 
people who volunteer to test a vaccine could 
die if they get sick. So, medical authorities 
around the world have adopted a cautious 
approach. Even 1 year after the outbreak be- 
gan, no such trial is underway. Meanwhile, 
11 vaccines have entered the final phase of 
conventional human testing, with 





Earlier challenge trials deliberately infected 
volunteers with flu virus and other pathogens. 


approve, scientists could infect the first vol- 
unteers in January 2021. The first studies, 
involving up to 100 volunteers ages 18 to 30, 
would identify the lowest viral dose needed 
to cause an infection. Then they could move 
forward with head-to-head comparisons of 
different vaccine candidates. Although the 
experiment won't replace traditional efficacy 
trials, it could highlight which vaccines war- 
rant further investment, Chiu says. 

Chiu is comfortable with intentionally 
infecting people with SARS-CoV-2 because 
there is growing evidence that young adults 
have a low risk of serious illness. He con- 
sulted with researchers at King’s College 
London who help manage an app tracking 
COVID-19 symptoms for more than 4 mil- 
lion people. Among 650 young adults with 
confirmed cases, 90% had no symptoms after 
3 weeks, and the symptoms of a few “outliers” 
resolved after a few months, Chiu says. 

In the United States, however, the en- 
couraging recent vaccine results have put a 
damper on human challenge trials. A month 
ago, virologist Matthew Memoli put the final 
touches on plans for a challenge trial at the 
National Institute of Allergy and Infectious 
Diseases (NIAID). But Anthony Fauci, head 
of NIAID, and Francis Collins, head of the Na- 
tional Institutes of Health, recently suggested 
the experiment is no longer necessary. 

That’s welcome news to some, who fear 
that challenge trials present serious risks to 
volunteers. Others, however, say the apparent 
success of some vaccines could strengthen 
the case for challenge trials. Once viable vac- 

cines are available, it’s harder to 


promising results already emerg- az justify continuing with traditional 
ing: Four vaccines have shown up Science's trials, in which thousands of people 
to 95% efficacy (see p. 1151). COVID-19 are given a placebo while others are 

Proponents say challenge experi- reporting Is vaccinated, says bioethicist Nir Eyal 
ments could still advance science ee eel nh of Rutgers University, New Bruns- 
and save lives. They could be used andthe wick. “Are you going to hope that 
to compare the effectiveness of dif- Heising-Simons tens of thousands of people will 
ferent vaccines, for instance, and Foundation. altruistically be willing to forgo ac- 


help quickly screen the most prom- 

ising second-generation candidates from the 
dozens now in the works. Challenge studies 
could also help answer unknowns, such as 
whether vaccines block infection entirely or 
simply prevent people from getting sick. 

It’s hard to use large trials to determine 
whether a vaccinated person still sheds the 
virus, says epidemiologist Marc Lipsitch 
of Harvard University. “It’s almost pro- 
hibitively expensive because in a 30,000- 
person trial you have to test 30,000 people 
a week,” he says. 

The initiative Chiu heads, the Human 
Challenge Consortium, is backed by £33 mil- 
lion from the U.K. government. If regulators 
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cess to a proven vaccine?” he asks. 

Seema Shah, a bioethicist at Northwest- 
ern University, thinks human challenge trials 
could still be useful, but there is less urgency 
now. And she would like to see more ways 
to ensure participants’ safety, such as find- 
ing biological markers for people at greater 
risk of getting sick. “I think I would feel more 
comfortable waiting a bit to see how those 
things developed and whether the risk could 
be minimized further,’ she says. 

Lipsitch agrees that “the immediate 
need is not great” for human challenge tri- 
als. But, he adds, researchers “need to keep 
preparing in case we have reason to start 
doing them.” 
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COVID-19 


Amid the cheering, some vaccines face questions 


A candidate from AstraZeneca and the University of Oxford has puzzling efficacy data 


By Jon Cohen 


he drumbeat of good news about 

COVID-19 vaccines has continued, 

albeit with some dissonant notes. 

By now four makers have reported 

results ranging from encouraging to 

stunningly positive from phase III 
clinical trials. The announcements, deliv- 
ered by press release, have left scientists 
hungering for more data, but have per- 
suaded many that having a safe and effective 
vaccine against SARS-CoV-2, the pandemic 
coronavirus, has moved from a question 
of if to when and how. “It’s very 
good to have positive results from 
different platforms, says Ana 
Maria Henao Restrepo, who leads 
an effort at the World Health Or- 
ganization to stage COVID-19 vac- 
cine comparisons. 

At the same time, two of the 
four COVID-19 vaccines face more 
questions than the other pair. 
AstraZeneca and the University 
of Oxford promoted their candi- 
date, which uses an adenovirus 
to deliver the gene for the spike 
protein of SARS-CoV-2, as being 
nearly as protective as vaccines 
from Moderna and Pfizer and 
BioNTech. Those vaccines, which 
rely on messenger RNA (mRNA) 
to deliver the genetic code for 
spike, prevented symptomatic disease with 
95% efficacy and appeared to block serious 
COVID-19 almost entirely. 

But the AstraZeneca/Oxford announce- 
ment last week came from an interim 
analysis blending two trials of the vaccine 
in which people received different doses— 
apparently by accident. Efficacy ranged 
from 62% to 90%, depending on the dosing 
strategy. Meanwhile, Russia’s Gamaleya Re- 
search Institute of Epidemiology and Micro- 
biology, which also uses adenoviruses as 
vectors to deliver spike’s gene, claimed 
91.4% efficacy based on a second interim 
analysis of its candidate, Sputnik V. But that 
rested on far fewer COVID-19 cases than 
Moderna or Pfizer and BioNTech reported 
and the Gamaleya trial is still ongoing. 

The AstraZeneca/Oxford effort con- 
ducted initial efficacy trials in the United 
Kingdom and Brazil. Science has learned 
from a scientist working with the partner- 
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ship who asked not to be named that a cal- 
culation error early in the U.K. study led to 
vaccine vials being filled with half the in- 
tended dose. Rather than drop people who 
had already received that shot, the team 
modified the trial protocol to compare peo- 
ple who received a half dose followed by a 
full dose with those given two full doses. 
An AstraZeneca official acknowledged the 
half dosing was a mistake, but Oxford re- 
searchers challenged this, and neither side 
would respond to requests to further discuss 
the issue. “It’s a head scratcher on a num- 
ber of levels, and this is the worst aspect of 
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AstraZeneca says it can produce at least 3 billion doses of its COVID-19 
vaccine next year. 


science by press release,” says immunologist 
John Moore of Weill Cornell Medicine. 

The collaboration reported people who 
received the half-dose priming shot were 
best protected from symptomatic COVID-19. 
But Moncef Slaoui, chief scientist of the 
U.S. government’s Operation Warp Speed, 
which has invested heavily in the vac- 
cine, notes that this substudy excluded 
elderly people. 

The leading hypothesis for why vari- 
ous groups responded differently to the 
AstraZeneca/Oxford vaccine depending on 
dosing is an immune response against the 
adenovirus vector. With both that vaccine 
and the Russian one, volunteers received 
a “prime” first dose and a booster shot 
at least 1 month later. But the Gamaleya 
scientists used different adenoviruses for 
the prime and boost, so that an immune 
response to the first adenovirus would not 
thwart the booster shot. AstraZeneca and 


Published by AAAS 








Oxford used the same adenovirus vector 
for both shots, which could explain why 
the larger boost dose produced the weaker 
result in its trial. “I would bet on that be- 
ing a contributor, but not the whole story,” 
says Adrian Hill, director of Oxford’s Jen- 
ner Institute, which designed the vaccine. 

Moore says the vaccines’ versions of the 
spike protein may also matter. In nature, 
viral surface proteins can “wobble.” His 
group in 2002 added stabilizing muta- 
tions to HIV’s surface protein, which other 
groups later showed led to more robust an- 
tibody responses when it was injected into 
animals. Moderna, Pfizer and Bi- 
oNTech, and other COVID-19 vac- 
cinemakers have added similar 
mutations to spike. Hill’s team 
did not. “Oxford is out on a limb,” 
Moore says. 

In the United States, Warp 
Speed has enrolled about 11,000 
of a planned 40,000 participants 
in a third efficacy trial of the As- 
traZeneca/Oxford vaccine that 
uses the full-dose prime. The 
study may obtain an efficacy sig- 
nal before being fully enrolled 
or it could switch to a half-dose 
prime, Slaoui adds, if researchers 
tease out the factors that led to 
90% protection. 

Henao Restrepo says it’s impor- 
tant to factor in that the Astra- 
Zeneca/Oxford vaccine is relatively cheap, 
about $3 per dose, and only needs refrigera- 
tion temperatures for storage. The mRNA 
vaccines will cost at least $20 per dose and 
must be kept at subzero temperatures. As- 
traZeneca also says it can produce at least 
3 billion doses next year, far more than pro- 
jections from any other manufacturer. 

A U.S. Food and Drug Administration 
vaccine advisory group plans to publicly 
review applications for emergency use au- 
thorization from Pfizer and BioNTech on 
10 December and from Moderna 1 week 
later. The candidate from Pfizer and BioN- 
Tech appears poised to receive similar U.K. 
authorization as soon as this week. 

But Hill says the COVID-19 vaccine race 
isn’t over. “If we provide more vaccine next 
year, particularly in places where there’s a 
lot of disease like Brazil and India, and save 
more lives, that’s winning to us—not having 
done the press release first,” he says. 
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SAVING SANCTUARIES 


Critics charge that sanctuaries for retired research 
chimpanzees are failing their animals. Can a new tool help? 


n a chilly morning in early Oc- 
tober, primatologist Steve Ross 
drives up to a chain-link gate 
blocking a narrow dirt road in 
northern Georgia. He steps out of 
his car into a swath of the Blue 
Ridge Mountains and watches 
the cresting Sun illuminate low 
fog on the wooded hills. “When 
I went to Tanzania,” he says, thinking back 
on the first time he saw wild chimpanzees, 
“It was just like this.” 
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By David Grimm, 
in Morganton, Georgia 


A green sign on the gate reveals why Ross 
is here: “Project Chimps,” it reads. “Provid- 
ing Lifelong Sanctuary to Chimpanzees Re- 
tired from Research.” 

The refuge is one of about a half-dozen 
chimp sanctuaries in North America, all 
designed to give the primates a more nat- 
ural life than they had in labs, homes, or 
the entertainment industry. Founded just 
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6 years ago, Project Chimps made a name for 
itself by promising to retire more than 
200 chimpanzees—about one-third of all 
ex-research chimps at the time—from a 
major biomedical facility. 

Yet, since April, about a dozen former 
workers have alleged that the sanctuary is 
jeopardizing the welfare of its apes with 
inadequate veterinary care, insufficient 
outdoor access, and failing infrastructure. 
Similar charges have been leveled against 
two other major chimp sanctuaries, rais- 
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Primatologist Steve Ross begins his 
assessment at Project Chimps. 


ing concerns about nearly half of the North 
American system. 

“I think sanctuaries are going to be ques- 
tioned very seriously going forward,” says 
Cindy Buckmaster, chair of the board of di- 
rectors of Americans for Medical Progress. 
When the U.S. government decided to retire 
its chimpanzees in 2015, she and others in 
the biomedical community predicted these 
problems, saying animals would actually 
fare worse in sanctuaries. The facilities 
have taken on too many chimps too quickly, 
Buckmaster argues, and lack the resources 
and experience to properly care for them. 

Ross is here to see whether that’s true. 
The director of ape study and conservation 
at Chicago’s Lincoln Park Zoo, he’s a pro- 
ponent of sanctuaries and the board chair 
of the world’s largest, Louisiana’s Chimp 
Haven. He’s also one of the country’s most 
respected chimp experts and has published 
dozens of studies on how to best care for the 
animals in captivity. By his own account, he 
has visited every chimp facility in North 
America, from tiny roadside zoos to giant 
biomedical operations. “As long as it wasn’t 
born in the past 5 years,” he says, “I’ve met 
that chimp.” 

Today, Ross has arranged to arrive un- 
announced at Project Chimps to try out a 
new tool he’s developing to assess chimpan- 
zee welfare. Based on what the latest science 
says these animals need, the approach con- 
siders everything from the size of chimp so- 
cial groups to the thickness of their bedding. 
The result: a comprehensive report of what 
a sanctuary is doing right and wrong, as well 
as an overall score that the public can see. 

Ross hopes his tool, a pilot project funded 
by the Arcus Foundation, will give sanctu- 
aries more scientific guidance and help the 
system keep its trust with the outside world, 
including the donors critical to its survival. 
If widely adopted, he thinks his approach 
could lead to better lives for chimps in fa- 
cilities throughout the world. But all of that 
depends on how the tool performs today at 
Project Chimps. 


A STAFFER DRIVES UP and opens the gate. 
Ross, 50, with salt-and-pepper stubble, con- 
tinues down the narrow road until it ends 
at a tan, three-story building known as the 
Chateau. At 650 square meters, this is the 
largest structure on campus and home to 
two groups of 14 chimpanzees, more than 
one-third of the 78 animals that live at Proj- 
ect Chimps. The Chateau and four other 
chimp “villas” punctuate a tall concrete 
wall topped with electric wire that encircles 
a forested, 2.5-hectare habitat. 
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Soon after we arrive, employees open the 
Chateau’s outer gates. One group of chimps, 
hooting and banging the walls in anticipa- 
tion, makes its way into the habitat. The 
gang includes celebrity pair Hercules and 
Leo, the oldest males in the group. For sev- 
eral years, they were the subject of an animal 
rights campaign to have them declared legal 
persons and released from their university 
lab. Today, the two have made new friends 
and “appear to enjoy life,’ says Project 
Chimps Executive Director Ali Crumpacker, 
who has joined us for the visit. 


Chimps on the move 

After invasive research on chimpanzees in the United 
States ended in 2015, sanctuaries began to take large 
numbers of retired chimps from biomedical facilities. 
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Ross, his pen perennially pressed to a 
stack of papers in his left hand, eyes the 
chimps from the roof as they expand into 
the hilly thicket of sourwood, pine, and wal- 
nut trees. Some scour for coconut shavings 
and pistachios the staff has scattered on 
the ground. Others sprawl on Sun-soaked 
climbing platforms. And others poke sticks 
into an artificial termite mound, pulling up 
peanut butter, applesauce, or whatever em- 
ployees have loaded it with today. “Subjec- 
tively speaking,” Ross says, “this is the best 
chimp yard in North America.” 

But it’s also the subject of controversy. 
The habitat has been Project Chimps’s main 
selling point, helping it secure millions of 
dollars in donations as well as the confi- 
dence of the New Iberia Research Center, 
which in 2016 pledged to send all 220 of 
its chimpanzees here. Yet a website run by 
former sanctuary employees claims the ani- 
mals spend “90% of their retirement in con- 
crete rooms,’ not much different from their 
housing at New Iberia. 

Former workers also allege that Project 
Chimps brought in more animals than it 
could handle, leading to vicious fights and 
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an uptick in stress behaviors like rocking 
and hair pulling. And they claim the sanc- 
tuary’s only veterinarian had no primate ex- 
perience and that buildings are in disrepair. 
“There doesn’t seem to be much reason to 
continue to uproot these chimps, only to of- 
fer care that’s worse than they had at the 
research facility,’ says Crystal Alba, who 
worked at Project Chimps for 3 years before 
she says she was fired for speaking up. 

The accusations, detailed in local news 
coverage and in National Geographic, have 
attracted the attention of animal rights 
groups. Many have criticized the sanctuary 
online. One group, in a campaign reminis- 
cent of those animal advocates typically 
stage against research labs, protested out- 
side the home of a board member of the 
Humane Society of the United States, the 
sanctuary’s biggest funder. Signs called him 
an “animal abuser” and featured photos of 
a chimp with a deeply split lip. 

Former employees have also leveled 
charges against Florida’s Save the Chimps, 
where 228 chimpanzees reside. A 3-month 
probe by a local news outlet detailed allega- 
tions including employee fights over how to 
care for the animals, which may have led to 
the death of one chimpanzee, and a short- 
staffed medical department that was slow 
to euthanize a dying chimp. The U.S. De- 
partment of Agriculture (USDA) has cited 
the sanctuary for 12 breaches of the Animal 
Welfare Act over the past 5 years, three for 
significant violations of medical care. 

One former caregiver at Chimp Haven, 
who spoke to Science on condition of ano- 
nymity because of a nondisclosure agree- 
ment, raises complaints against that facility 
as well. She contends that the sanctuary—the 
only one funded by the U.S. government, and 
the home of more than 300 chimpanzees— 
didn’t treat serious wounds and didn’t fix 
heating issues in older buildings. She also 
alleges that staff was spread so thin it lost 
track of a male chimp that drowned in the 
sanctuary’s moat. 

Sanctuaries have pushed back against 
the accusations. Crumpacker says stress 
behaviors like hair pulling can predate an 
animal’s time at Project Chimps. She says 
the sanctuary’s veterinarian routinely con- 
sulted with primate experts and that a 
newly hired vet has some chimp experience. 
And although Crumpacker admits the ani- 
mals here don’t go outside as much as she 
would like—she estimates 12 to 24 hours per 
week—she points out that the New Iberia 
chimps have no outdoor habitat at all. 

To increase outside access, she’d have to 
further divide the sanctuary’s habitat, al- 
ready split in two, so more groups could go 
outdoors at the same time. But that would 
deplete funds for bringing more chimpan- 
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zees to the sanctuary, she says. “We not only 
have to do the right thing for the animals 
that are here, but also for those that are not 
here yet.” 

A spokesperson for Save the Chimps says, 
“Chimpanzee care is an extremely compli- 
cated undertaking,’ but “we have both the 
expertise and the best interests of chimps in 
mind.” Chimp Haven President Rana Smith 
adds that her sanctuary has enough staff to 
closely monitor every chimpanzee, and that 
it’s constantly improving its infrastructure. 
“We pride ourselves on exceptional welfare.” 

Inspections over the past few 
years by USDA and the Global 
Federation of Animal Sanctuar- 
ies (GFAS) have found no ma- 
jor problems at either Project 
Chimps or Chimp Haven. And 
although USDA cited Save the 
Chimps, it has not levied fines. 
But many experts across the 
sanctuary world say such evalu- 
ations are often superficial, in- 
consistent, and not tailored to 
chimpanzees. That’s something 
Ross hopes to change today. 


WHILE THE CHATEAU chimps en- 
joy their outside time, Ross en- 
ters their enclosure. There are 
11 rooms, including two large 
playrooms and two porches with 
views of the habitat. “It’s like a 
house,” Crumpacker says. “You 
can decide which room you want 
to be in, and whether you want 
to hang out with other people.” 

Ross measures the size of 
each room; the floor space cov- 
ered in hay, blankets, and other 
substrates; and the number of 
perches and ropes the animals 
can climb. He also looks for 
steel panels that partially block 
the view of other rooms, allow- 
ing the primates to hide from 
their comrades if they’re feeling 
stressed. “All of these are things 
we know from the literature 
are important for chimp well- 
being,” he says. 

Word comes over a walkie-talkie that an- 
other group of chimpanzees is about to be 
let out on the other side of the habitat. So 
we hop in a golf cart and head over to the 
Chimps Ahoy Villa, home to a special group 
of apes. 

Project Chimps, like most zoos and sanc- 
tuaries, is always trying to increase the size 
of its chimpanzee groups to better replicate 
wild troops of more than 100 individu- 
als of mixed genders and ages. It’s a tricky 
business, aS you never know who’s going 
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to get along with whom. Just last week, 
staff merged nine males with 10 females at 
Chimps Ahoy, making the group one of the 
biggest in the sanctuary system. 

Ross says biomedical facilities sometimes 
separate chimpanzees by sex to avoid ba- 
bies. “It’s so unnatural for chimps,” he says. 
“When you have single-sex groups, you miss 
out on all the things chimps do, including 
mating.” (Males here get vasectomies, and 
the females are on birth control.) 

We watch the chimps through a window 
in the giant concrete wall. Soon, a 32-year- 





Chimpanzees browse an outdoor habitat at Florida’s Save the Chimps (top). Leo, 
pictured at Project Chimps (bottom), was the subject of an animal rights campaign 
to have him declared a legal person. 


old chimp with a graying muzzle named 
Justin ambles by. He has a deep gash in his 
back, looking a bit like a velociraptor took 
a swipe at him. It’s the kind of untreated 
wound that might show up in a whistle- 
blower complaint. 

Ross is unfazed, however. He says Justin is 
a low-ranking chimp, which means he gets 
beat up to make sure he knows his place— 
the same sort of hierarchical conflict that 
happens in the wild. “If the sanctuary took 
him out of his group and gave him vet care,” 
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Ross says, “they might beat him up worse.” 

Still, Ross doesn’t take wounds lightly. As 
he travels from villa to villa throughout the 
day, he spot checks every chimp, looking for 
injuries and stress behaviors that could be 
a sign of poor welfare or unstable groups. 

Everything Ross records feeds into his 
new tool, which produces three scores: 
“social,” assessing all chimps and the size, 
composition, and stability of their groups; 
“spaces,” based on the size and complexity of 
the animals’ environments; and “programs,” 
gauging vet care, staff experience, and diets. 
The sanctuary gets a full report, 
and the public sees a summary 
and final grade. 

Ross’s inspection is much 
more chimp-specific than that 
of USDA, whose 336-page 
guidelines mention the word 
“chimpanzee” only three times. 
The agency’s assessments also 
tend to be brief to the point 
of obscurity. A recent review 
of Project Chimps contained 
a single sentence: “No non- 
compliant items identified dur- 
ing this inspection.” 

In addition, Ross says, “USDA 
standards haven’t changed much 
over my lifetime,’ whereas his 
review is based on the latest re- 
search. For example, Crumpacker 
says according to the agency’s 
metrics, Project Chimps’s smaller 
villas could house up to 45 chim- 
panzees. “Steve says 15. It’s not 
just about how many chimps can 
occupy a space—it’s about how 
well they live there.” 

GFAS inspections are more 
thorough than those of USDA, 
but they’re not made public and 
not chimp-specific. Ross’s ap- 
proach is also more scientific, 
Crumpacker says. “Steve was 
asking about all of the [bed- 
ding] materials we use, and how 
much each chimp got, in inches,” 
she says. “GFAS might just say 
a group of chimps should get 
12 blankets a day. And USDA 
just cares that we wash them.” 

GFAS Executive Director Valerie Taylor 
agrees Ross’s tool adds value. “We would ab- 
solutely welcome something like that,’ she 
says. “There’s power in pooling resources.” 

Andrew Halloran, director of chimpanzee 
care at Save the Chimps, also embraces the 
tool. “Taking care of chimpanzees is a com- 
plicated science, and Steve’s approach treats 
it like one,” he says. His facility has recently 
adopted new guidelines with a more scien- 
tific approach to chimp welfare. Sanctuaries, 
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Project Chimps’s large, lush outdoor habitat has been 


Halloran says, are evolving from rescue or- 
ganizations into more sophisticated opera- 
tions that realize that an “evidence-based 
way of doing things is critical to providing 
good care.” 

Experienced biomedical facilities, mean- 
while, would likely have no use for Ross’s 
tool, says Steven Schapiro, who oversees 
primate welfare at the MD Anderson Cancer 
Center. He says operations like his, which 
cared for hundreds of chimps for decades, 
already have the knowledge they need. 

Still, Schapiro says he thinks Ross’s tool 
could help sanctuaries, which he believes 
“are 100% committed to the welfare of their 
chimpanzees.” The problem, he says, is that 
they lack access to the same knowledge and 
funding as the biomedical community. 

There’s no easy fix for those deficien- 
cies, Buckmaster argues. If Ross recom- 
mends sanctuaries hire more experienced 
staff, for example, she says, “I don’t know 
where they’re going to find them.” That’s 
why she and others in the biomedical 
community have argued that ex-research 
chimps should “retire in place” at research 
labs. That debate continues to play out as 
the National Institutes of Health considers 
which of its remaining 135 or so chimpan- 
zees will move to sanctuaries as planned. 
Last year, the agency announced that nearly 
four dozen chimps would stay at the Al- 
amogordo Primate Facility in New Mexico 
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one of its main selling points—and one of its biggest sources of controversy. 


because they were too old or sick for trans- 
port. It reiterated that position last month 
despite congressional pressure. 

Ross’s approach also raises an existential 
issue for sanctuaries: Is it even possible to 
give chimpanzees what they need in captiv- 
ity? That’s the concern of primatologist John 
Mitani, professor emeritus at the Univer- 
sity of Michigan, Ann Arbor, who has stud- 
ied chimpanzees in the wild for more than 
30 years. He notes that chimp groups in 
Africa regularly split apart and come back 
together, which they can’t do in captivity— 
and that even a large habitat like the one at 
Project Chimps can’t match the more than 
30-square-kilometer range of wild chimps. 
“Chimps like to come and go, just like peo- 
ple? Mitani says. Without that, he says, they 
can become anxious and violent—something 
even the best sanctuaries can’t prevent. 


AT THE END OF THE DAY, Ross returns to Proj- 
ect Chimps’s main office to talk about next 
steps with Crumpacker. Photos of more 
than 200 chimpanzees hang on the wall, 
with black-and-white pictures denoting 
animals still at New Iberia. The research 
center still plans to transfer most of its 
123 remaining apes here. “They have been 
quite diligent in their approach to the care of 
the chimps,” says Director Francois Villinger. 

Ross’s review, announced earlier this 
week, turns out to be mostly good news for 
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Project Chimps. The sanctuary earns a to- 
tal score of 81 out of 100, with high marks 
for its large group sizes and exceptional 
outdoor space. But the report also dings 
the sanctuary for the apes’ limited access 
to that space and for its relatively inexperi- 
enced veterinary and care staff. 

“Based on the allegations, I went in pretty 
skeptical,” Ross says. “But overall I think the 
chimps are well cared for.’ 

Crumpacker says Project Chimps will 
“try and make strides” toward addressing 
the recommendations—although she isn’t 
conceding every point. Ross recommends 
more scattering of food to better replicate 
natural foraging, for example, but she says 
hand feeding ensures that chimps get the 
right portions. 

For his part, Ross says he will continue to 
refine his tool as he visits more North Ameri- 
can chimpanzee sanctuaries, all of which 
have signed on. He hopes to expand the ap- 
proach to other types of chimp facilities, and 
to eventually help create similar tools for 
monkeys, horses, and other animals. 

Although Mitani doesn’t think it’s pos- 
sible to give chimpanzees an ideal life in 
captivity, he welcomes Ross’s attempt to im- 
prove conditions. “He’s trying to make the 
best of a bad situation, so more power to 
him,” he says. “Even if this tool just makes 
things a bit better, we owe these animals 
more than they have right now.” m 
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AAAS/SUBARU SB&F PRIZES FOR EXCELLENCE IN SCIENCE BOOKS 


Fodder for future scientists 


From a monarch mystery unlocked with the help of Indigenous people 
to an environmental disaster averted by a chemist who was not afraid 

of a fight, this year’s finalists for the Science Books and Films (SB&F) Prizes 
for Excellence in Science Books highlight how diverse perspectives, grit, 
and a healthy dose of hope are essential to the effective practice of science. 
Read on for reviews of the finalists written by the staff of the Science 
family of journals, with help from a few friends. —Valerie Thompson 


HANDS-ON SCIENCE BOOK 


Chemistry for Kids 


Reviewed by Pamela J. Hines! 


If you were to choose 25 discoveries to doc- 
ument the progress of chemistry through 
millennia, what would you pick? In Chem- 
istry for Kids, Liz Lee Heinecke takes us 
on such a journey, using familiar objects 
and simple scientific instruments to create 
straightforward chemistry experiments that 
chart the field’s evolution over time. 

Each chapter is centered on a different 
experiment and begins with a vivid illustra- 
tion that highlights a scientist and his or 
her work. A few paragraphs of engagingly 
written introduction are followed by color- 
ful photographs of youngsters demonstrat- 
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ing the steps of the experiment. A brief 
explanation of the chemistry that underlies 
each experiment wraps up each chapter. 
The author’s decision to include many 
women, even those not officially known as 
“scientists,” is most welcome. Marie Curie’s 
process of elemental precipitation, by which 
she extracted radium, is modeled (much 
more safely) with food coloring and rose- 
mary leaves. Meanwhile, the environmental 
spread of the insecticide DDT that Rachel 
Carson brought attention to is visualized by 
the diffusion of food coloring through gela- 
tin. Agnes Pockels, a pioneer of surface sci- 
ence, found inspiration in the bubbles that 
formed as she washed dishes. Using dish 
soap, milk, and food coloring, we can see 
for ourselves how surface tension drives the 
formation of colorfully abstract patterns. 
The topics covered range broadly. Some 
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chapters detail things we think of as part of 
everyday life (for example, soap or carbon- 
ated drinks). Other chapters touch on fron- 
tiers, including medicinal plant chemistry 
and crystallography. Older children could 
do many of these experiments on their own; 
younger children might need supervision. 


Chemistry for Kids: Homemade Science Experiments 
and Activities Inspired by Awesome Chemists, Past 
and Present, Liz Lee Heinecke, Quarry Books, 2020, 
128 pp. 


This Is a Book to 
Read with a Worm 


Reviewed by Caroline Ash? 


Charles Darwin’s final book was dedicated to 
the gradual way in which earthworms change 
the world by turning over soil (7). He was not 
the only early naturalist to appreciate this 
diminutive invertebrate. In the 18th century, 
Gilbert White acknowledged the “lamentable 
chasm” that would be left without the “small 
and despicable” worm to help make soil (2). 
This Is a Book to Read with a Worm is a 
colorful delight and an important little book 
that follows in White and Darwin’s footsteps, 
promising to help nurture a new generation 
of naturalists. It offers a mix of activities and 
facts about earthworms that a curious child 
could do independently (or, to be more pre- 
cise, with an earthworm). The book starts 
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with guidance for how to catch a worm and 
how to care for it in a simple homemade ter- 
rarium. Several suggested experiments reveal 
information about worms’ sensory responses 
and anatomy. For instance, by crafting a lis- 
tening tube made from paper, you can hear 
them crawling thanks to the scratching of 
their barely visible setae. 

Readers learn that in the United States, 
imported worm species are eating all the 
leaves that make up native earthworms’ diet, 
to the detriment of native species. This could 
have profound consequences, not only for life 
underground but also for the viability of the 
forests that rely on the leaf consumption and 
soil churning capacity of native worms. 





This Is a Book to Read with a Worm, 
Jodi Wheeler-Toppen, Illustrated by Margaret 
McCartney, Charlesbridge, 2020, 32 pp. 


REFERENCES AND NOTES 


1. C. Darwin, The Formation of Vegetable Mould Through 
the Action of Worms, with Observations on Their Habits 
(John Murray, 1881). 

2. G.White, The Natural History and Antiquities of Selborne 
(Benjamin White, 1789). 


You're Invited to 
a Moth Ball 


Reviewed by Seth Scanlon? 


Of all the insects that flitter and flutter into 
our homes, among the least welcome is the 
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common clothes moth (Tineola bisselliella), 
which has a penchant for ruining shirts, 
sweaters, and sundry other garments. How- 
ever, this perfidious pest is just one of more 
than 150,000 different moth species. In 
You’re Invited to a Moth Ball, Loree Griffin 
Burns invites budding entomologists to em- 
brace the “moth ball,” a nocturnal occasion 
to encounter the myriad moth species that 
live just outside our homes. 

The text, written in a warm and casual 
register, first describes how to employ 
a combination of fruit smeared on trees 
and specialized ultraviolet lights to attract 
moths to viewing stations made using light- 
colored sheets hung on clothes lines. The 
book then details what to expect of your 
lepidopteran guests and offers some ad- 
vice for maximally enjoying your nighttime 
rendezvous (although it fails to offer advice 
about how to handle other wildlife that 
might seek to gate-crash the ball). Along the 
way, Burns provides some useful, if basic, 
information about moth biology. 

The book’s vibrant and inviting photo- 
graphs, captured by Ellen Harasimowicz, 
are a true highlight. Indeed, the great va- 
riety and beauty of moths shown made me 
seriously consider gathering up the neces- 
sary supplies and having my own backyard 
moth party. However, the decision to omit 
captions, which might have explained the 
moths depicted in the photos, was a missed 
opportunity for further edification. I would 
have also liked to have seen more options for 
families who do not have access to backyards 
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with trees. Still, this is a fun and engaging 
book that encourages children to explore the 
wonders of the world around them and that 
should hopefully spur them to seek out more 
in-depth sources of information, helpfully 
signposted in the book’s “Resources” section. 





You’re Invited to a Moth Ball: A Nighttime Insect 
Celebration, Loree Griffin Burns, Photography by Ellen 
Harasimowicz, Charlesbridge, 2020, 40 pp. 


Creek Critters 


Reviewed by Trista Wagoner* 


Small streams are often overlooked and 
underappreciated, but they provide ample 
opportunities for curious kids to learn what 
it is like to be a scientist. In Creek Critters, a 
girl whose name we never learn invites her 
younger brother Lucas to explore a stream 
one might find in the mid-Atlantic region of 
the United States. The pair turn over rocks 
and scoop up rotting leaves in search of 
aquatic macroinvertebrates that ecologists 
use as indicators of a healthy ecosystem. 
Most of this short book focuses on insects, es- 
pecially in larval forms, describing their sizes 
and shapes along with appealing, detailed il- 
lustrations and photographs. The children’s 
goal is to find species, such as the mayfly and 
the caddis fly, that are sensitive to pollutants. 
“If we find them,’ the girl explains, “well 
know our creek is healthy.” 

The book ends with a few pages of activi- 
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ties, including a scavenger hunt and a tem- 
plate for recording observations about the 
environment. The last page lists noninsect 
biological indicators of stream health, such 
as snails, crayfish, and planaria. The pub- 
lisher’s website includes Spanish-language 
versions of the story (audio and text), and 
the Audubon Naturalist Society offers a 
“Creek Critters” companion app and addi- 
tional activities. 

I found myself wanting more from the 
book—more information about the impor- 
tance of healthy streams, more explanation 
of how pollutants affect flora and fauna, and 
more illustrations—but most of all, it left me 
with the feeling that I want more time to ex- 
plore my own nearby creeks. It may likewise 
provide inspiration and direction for chil- 
dren who enjoy getting their hands wet and 
their boots muddy. 





Creek Critters, Jennifer Keats Curtis with Stroud Water 
Research Center, Illustrated by Phyllis Saroff, Arbordale 
Publishing, 2020, 32 pp. 


CHILDREN’S SCIENCE PICTURE BOOK 


Mario and the Hole 
in the Sky 


Reviewed by Sacha Vignieri? 


During the 1970s, a time when many em- 
braced the motto “better living through 
chemistry,’ Mexican chemist Mario Molina 
questioned the safety of chlorofluorocarbons 
(CFCs) that had begun appearing in an in- 
creasing number of products, from refriger- 
ants to medical aerosols. He wondered about 
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the persistence of these compounds and 
their effects on the environment. When he 
realized that they were degrading the ozone 
layer, a key component of our atmosphere, 
he fought tirelessly against their use. In the 
end, he won the fight; laws were put in place 
around the world to limit the use of CFCs, 
but the path was not easy, and Molina and 
his colleagues faced vitriolic criticism. 

With gentle and engaging imagery, 
Mario and the Hole in the Sky conveys 
Molina’s story, rendering him first as a curi- 
ous child obsessed with his microscope and 
later as a worried scientist determined to 
convince a skeptical public of the dangers 
of CFCs. The book urges readers to follow 
their curiosity and to be prepared to fight 
hard for the health of the planet. “We saved 
our planet once. We can do it again,” Molina 
concludes at the end of the story, evoking an 
optimistic view of the world that is comfort- 
ing and full of potential. 

Molina, who passed away in October of 
this year (see Retrospective, p. 1170), worked 
tirelessly to educate the world about CFCs 
and overcame naysayers and critics to en- 
sure that they were phased out of use. Today, 
the threats we face are even more complex, 
the resistance to change stronger, and the 
global political environment even less con- 
ducive to collaboration. Nonetheless, the 
impact of solid science, collaboration, and 
inspiration is still powerful. Molina’s story, 
as presented in this lovely book, may help 
young readers see hope in similar efforts, 
and such hope will carry us forward in 
these challenging times. 


Mario and the Hole in the Sky: How a Chemist 
Saved Our Planet, Elizabeth Rusch, Illustrated 
by Teresa Martinez, Charlesbridge, 2019, 40 pp. 
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The Boy Who Dreamed 
of Infinity 


Reviewed by Kelly Servick® 


This coming-of-age tale is so vibrant and 
imaginative that a reader might forget that it 
is also the true story of Indian mathematician 
Srinivasa Ramanujan, who made substan- 
tial contributions to number theory and the 
study of infinite series. Author Amy Alznauer, 
a math professor at Northwestern University, 
takes readers to Ramanujan’s birthplace in 
South India at the turn of the 20th century. 
We meet Ramanujan first as a cherished 
infant prophesied to become a great thinker. 
But for most of this story, we see a more re- 
latable character: a bored student, irritat- 
ing teachers with his precociousness and 
sneaking off to be alone with his thoughts. 
When Ramanujan contemplates a mango— 
mentally cutting it into smaller and smaller 
pieces—he stares into infinity. In clouds, 
temple statues, and his mother’s folktales, 
Ramanujan sees numbers and mysteries. De- 
spite his mathematical brilliance, he fails out 
of college and is left destitute, despairing that 
his ideas will never reach the wider world. 
Alznauer’s story ends as Ramanujan accepts 
an invitation from mathematician G. H. 
Hardy to work and study in England—a deci- 
sion that will propel Ramanujan to greatness. 
By focusing mainly on his youth, The Boy 
Who Dreamed of Infinity offers only vague 
glimpses of Ramanujan’s ultimate contribu- 
tions to mathematics. But the book’s message 
transcends the world of academic research. 
This vivid story, which tumbles across the 
book’s pages in Daniel Miyares’s kinetic 


sciencemag.org SCIENCE 


ILLUSTRATION: PHYLLIS SAROFF FOR CREEK CRITTERS 


INSIGHTS | BOOKS 





ILLUSTRATION: © BY ERIC ROHMANN 


illustrations, celebrates the power of a self- 
assured and independent child. It could en- 
courage young readers to identify their own 
passions and acknowledge their own bril- 
liance before the adult world gets wise. 


The Boy Who Dreamed of Infinity: A Tale 


of the Genius Ramanujan, Amy Alznauer, 
lllustrated by Daniel Miyares, Candlewick, 2020, 48 pp. 


Winged Wonders 


Reviewed by Jeffrey Mervis’ 


It should not be hard to interest children in 
a story about monarch butterflies. But Meeg 
Pincus, the author of Winged Wonders, has a 
more ambitious goal: to teach them how sci- 
entists go about their work. 

Her story starts by introducing readers to 
Canadian zoologists Fred and Norah Urqu- 
hart, who began studying monarch migration 
patterns in the 1940s. Pincus then presents 
a cast of supporting characters, including 
citizens who answered ads asking people to 
tag the monarchs living in their neighbor- 
hoods, a couple who followed the monarchs 
throughout Mexico, and the villagers who 
celebrated the fall arrival of the monarchs, 
which roost in the oyamel tree groves high 
in the Sierra Madre. Large, lush drawings by 
illustrator Yas Imamura invite children to in- 
sert themselves into the story. 

Pincus does not stop at retelling the 
Urquharts’ story. Instead, she encourages 
readers to contribute to the ongoing re- 
search on monarchs and to work to protect 
their habitats. 

The book’s bare narrative is supple- 
mented by a final page filled with details 
about the Urquharts and those who have 
followed in their footsteps. However, those 
with additional questions—for example, 
why do monarchs migrate, and do they all 
travel such great distances?—will need to 
consult other sources. But that is a minor 
quibble if this mystery story kindles your 
child’s interest in this iconic species and 
environmental stewardship. 





Winged Wonders: Solving the Monarch Migration 
Mystery, Meeg Pincus, Illustrated by Yas Imamura, 
Sleeping Bear Press, 2020, 40 pp. 


Honeybee 


Reviewed by Jennifer Sills? 


In Honeybee: The Busy Life of Apis Mellifera, 
Candace Fleming uses rhythmic, meditative 
prose to describe the short life of a honey 
bee. The story begins as Apis “squirms, 
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pushes, chews” through her wax cell into 
the “teeming, trembling flurry” of the hive. 
She feeds larvae, tends to the queen, builds 
honeycomb cells, handles food, and guards 
the hive. Finally, in a climactic four-page 
spread, Apis takes flight. But her work is 
not done; before she stills, Apis will fly 500 
miles and visit 30,000 flowers. On her 35th 
day, as Apis’s story comes to an end, another 
honey bee emerges from her “solitary cell.” 

Vivid oil paintings illustrate Apis in me- 
ticulous detail, along with her fellow bees 
and the world she travels. The story gives 
readers an appreciation of the many hid- 
den tasks that take place inside a beehive, 
as well as the cooperative structure of a bee 
colony, where each individual contributes 
to the group in a variety of ways. 

At the end of the book, an appendix pro- 
vides information about honey bees’ crucial 
role in human food production, the threat 
of colony collapse disorder, and what read- 





ers can do to help. A final section adds in- 
formation about the other occupants of the 
hive—the queen and drones—and more 
details about the intriguing dance perfor- 
mances that honey bees use to communi- 
cate the location of flowers to one another. 





Honeybee: The Busy Life of Apis Mellifera, 
Candace Fleming, Illustrated by Eric Rohmann, 
Neal Porter Books, 2020, 40 pp. 


MIDDLE GRADES SCIENCE BOOK 


Can You Hear 
the Trees Talking? 


Reviewed by Marc S. Lavine? 


We often think of trees when the leaves 
change color in the fall or when devastating 
forest fires, disease, or invasive species lead 
to widescale loss. Few of us take the time to 
consider, however, the many parallels that ex- 
ist between trees and more mobile creatures. 
Trees can form communities, in which elders 
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nurture and educate the young. Collectively, 
they can moderate the local temperature 
and humidity, store water, and protect each 
other. Isolated trees may mature faster than 
trees surrounded by others of the same spe- 
cies, but this accelerated growth can harm 
the lone tree’s longevity. In Can You Hear the 
Trees Talking?, Peter Wohlleben provides a 
fascinating and detailed guide to these less 
appreciated aspects of tree life. 

Trees use senses akin to taste and smell to 
detect and deter predators and alert neigh- 
boring trees. However, much of the com- 
munication in a forest occurs belowground. 
Neighboring tree roots of the same species 
share nutrients and pass along messages 
about local threats. Fungi that live off dead 
trees or leaves send out a network of fine 
threads that act like a highway for chemi- 
cal messages. But this relay service comes 
at a cost: Fungi can demand up to a third 
of the sugar that a tree produces, and they 
will sometimes share that sugar with trees of 
other species to aid their own preservation. 

Wohlleben wrote this book as a young 
reader’s edition of The Hidden Life of Trees 
and has done a remarkable job ensuring that 
each page tells a complete story, answering 
questions ranging from how trees know when 
it is spring to why trees are important in the 
city. Activities and quizzes appear throughout 
the book to engage readers of all ages. 


Can You Hear the Trees Talking? Discovering the 
Hidden Life of the Forest, Peter Wohlleben, Translated 
by Shelley Tanaka, Greystone Kids, 2019, 84 pp. 


Eclipse Chaser 


Reviewed by Adrian Cho”? 


On 21 August 2017, the Moon glided between 
Earth and the Sun, creating an eclipse visible 
across the United States. Millions of people 
flocked to the path through 13 states along 
which the Moon would completely cover the 
disk of the Sun and create a total eclipse— 
among them were Shadia Habbal, a solar 
physicist, and her team. In Eclipse Chaser, 
science reporter Ilima Loomis recounts their 
exciting yet sometimes anxious expedition to 
study the solar corona, the Sun’s atmosphere, 
which is visible only during a total eclipse. 

Whereas many solar physicists rely 
on huge purpose-built telescopes on the 
ground and in space to study our at times 
capricious little star, Habbal relies instead 
on eclipses and portable telescopes to probe 
the still poorly understood corona, trekking 
to remote peaks and deserts, where bad 
weather or a sudden sandstorm can spoil 
months of preparation. 

Loomis deftly explains the mechanics of 
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Physicist Shadia Habbal 
and her team study the solar 
corona in Eclipse Chaser. 


an eclipse and the questions Habbal is try- 
ing to answer. She conveys the homey feel 
of the expedition, on which Habbal’s sister 
treats the team to a homemade chicken 
dinner. In a gratifying nuance, Loomis 
notes that, for years, Habbal seldom looked 
at the corona herself, as she was always too 
busy tending to her instruments and cam- 
eras. Only now that the systems run auto- 
matically can she turn her eyes skyward. 

Eclipse Chaser teems with intriguing 
photos snapped by Amanda Cowan. Visu- 
ally, the Sun is the real star of the show. 
The eerie photos of its gossamer corona 
stretching out across the darkened sky may 
make the reader long to see it in person. If 
he or she is so inclined, the book includes 
a handy map that shows the paths of total 
eclipses through 2060. 








tive with finely drawn central characters. 

Animal-mad children will love both the 
story and the abundant photography. Close- 
up shots of Yola and her family offer insight 
into the emotional bond and life of the go- 
rillas. Sidebars that explain gorilla biology, 
society, and behavior serve as either useful 
breathers or mildly jarring distractions, de- 
pending on the reader’s attention span. 

This is a book that packs a hefty educa- 
tional punch in a delightful package. Readers 
will finish with an understanding of the close 
evolutionary relationship between humans 
and other primates and the precarious con- 
servation status of gorillas. Most importantly, 
as they watch Yola flourish by “growing up 
gorilla,” they will learn that the best way to 
love and respect nonhuman animals is by 
keeping a little bit of distance. 





Eclipse Chaser: Science in the Moon’s Shadow, 
llima Loomis, Photography by Amanda Cowan, HMH 
Books for Young Readers, 2019, 80 pp. 


Growing Up Gorilla 


Reviewed by Cathleen O'Grady” 


Baby gorilla Yola faces a singular chal- 
lenge: Her mom, Nadiri, was raised by hu- 
mans and does not know how to parent. 
Growing Up Gorilla tells the story of the 
pair’s blossoming bond as the keepers at 
Seattles Woodland Park Zoo care for the 
newborn gorilla and gently encourage 
Nadiri’s maternal instincts. 

Clare Hodgson Meeker’s deft storytelling 
weaves together Yola’s first few months of 
life and Nadiri’s experience of being raised 
by humans. The zookeepers explain that 
they are eager to develop the bond be- 
tween mother and baby because Yola will 
best learn life skills that way. The result is 
a gentle but surprisingly gripping narra- 
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Growing Up Gorilla: How a Zoo Baby Brought Her 
Family Together, Clare Hodgson Meeker, Millbrook 
Press, 2019, 48 pp. 


Condor Comeback 


Reviewed by Laura M. Zahn”? 


In 1987, the last California condor in the wild 
was taken into captivity. Along with 21 oth- 
ers of its kind—together representing the last 
of the species—the bird was used to launch 
a captive breeding program. The eventual 
goal of the program was to return condors 
to the wilds of the western United States. 
Part of the Scientists in the Field series, 
this book tells the condor’s comeback story 
through the work of conservation scientist 
Estelle Sandhaus, director of conservation 
and science at the Santa Barbara Zoo, and 
her collaborators. The book details Sand- 
haus’s efforts to identify how human inter- 
vention can ensure that individual birds 
thrive outside of captivity, thereby ensur- 
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ing that the wild population will grow. 

Condors still face multiple threats, includ- 
ing potentially serious health risks associated 
with eating “microtrash” (for example, small 
pieces of metal, shards of glass, or plastic 
packaging) and carcasses containing lead 
shot. The book takes readers along on a health 
assessment of wild birds and highlights the 
many ways in which they are monitored, pro- 
viding insight into the job of a conservation 
scientist. Monitoring, together with captive 
breeding and reintroduction programs, com- 
munity outreach, and policies banning lead 
shot, has resulted in successful reproduction 
and reintroduction into the wild. 

Today, there are more than 450 California 
condors, about half of which are living in the 
wild. However, more conservation scientists 
are needed to ensure the species’ continued 
success. Hopefully this book will encourage 
children to consider entering this field. 


Condor Comeback, Sy Montgomery, Photography 
by Tianne Strombeck, HMH Books for Young Readers, 
2020, 96 pp. 


YOUNG ADULT SCIENCE BOOK 


The Last Stargazers 


Reviewed by Keith T. Smith” 


When a young person imagines an astrono- 
mer, they likely picture a white-coated indi- 
vidual who spends each night peering at the 
sky through a small refracting telescope on a 
roof. But, as Emily Levesque recounts in The 
Last Stargazers, the reality of being a profes- 
sional astronomer is very different. 

Observational astronomers must apply 
months in advance for competitive time on a 
limited number of giant telescopes in remote 
locations, perhaps securing just a few nights 
per year. After traveling to the observatory, 
they must work through the night from a 
control room, trying to collect enough pho- 
tons to produce a result. The weather is a 
constant threat—if it is unfavorable, weeks of 
work and travel could result in no data at all. 
Levesque successfully captures the strange 
mix of excitement, impatience, camaraderie, 
and sleep deprivation that I recognized from 
my own observing trips and relates numer- 
ous anecdotes about uncooperative wildlife, 
malfunctioning equipment, natural disasters, 
and idiosyncratic observatory staff. 

The book is partly autobiographical— 
Levesque is a professional astronomer who 
studies massive stars—weaving the author’s 
experiences with highlights from interviews. 
Historical tales of famous observers, such 
as Edwin Hubble and Vera Rubin, include a 
discussion of how many female astronomers 
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had to overcome gender bias. Levesque’s em- 
phasis is on optical astronomy, although she 
does mention radio telescopes and the new 
methods of gravitational wave astronomy, 
with a disappointingly heavy focus on Ameri- 
can astronomers and U.S. facilities. 

The book’s final chapters cover recent 
shifts toward remote-controlled, queue- 
scheduled, or fully automated observing. 
While such methods are cheaper, more ef- 
ficient, and more environmentally friendly, 
they do diminish the drama and wonder of 
professional astronomical observing. The 
Last Stargagers may ultimately become a his- 
torical record of a dying way of doing science. 


The Last Stargazers: The Enduring Story 
of Astronomy’s Vanishing Explorers, 
Emily Levesque, Sourcebooks, 2020, 336 pp. 


Borrowing Life 


Reviewed by Yevgeniya Nusinovich”* 


Borrowing Life by Shelley Fraser Mickle tells 
the story of the pioneering 20th-century 
surgeons, scientists, and patients who made 
organ transplantation a reality. Although the 
book’s primary focus is on kidney transplan- 
tation, Charles Woods—a young pilot who, 
in 1944, was severely burned in an airplane 
accident and unexpectedly survived with the 
aid of donor skin grafts—takes center stage in 
the first half of the story. 

Joe Murray, the surgeon who treated 
Charles, and one of the central figures in 
Mickle’s book, would go on to join a team of 
talented physicians under the supervision of 
surgeon Francis Moore, where he drew on 
immunologist Peter Medawar’s animal stud- 
ies to successfully execute the first human 
kidney transplant in identical twins in 1954. 
Mickle recounts how the subsequent applica- 
tion of the team’s technique using unrelated 
donors gradually became feasible with care- 
ful calibration of immunosuppression. 

Although Mickle met both Murray and 
Moore decades before writing this book, Bor- 
rowing Life draws primarily from archival 
documents, including memoirs written by 
the participants and contemporaneous news 
coverage. It is written in an informal style, as 
a series of disparate anecdotes that gradually 
converge on a single story. 

Mickle makes an unusual narrative deci- 
sion, framing the medical tale with the love 


stories of the key participants (Woods, Mur- 
ray, Moore, and Medawar) and their wives. 
Her attempt to recognize the efforts and per- 
sonal costs incurred by women with history- 
making husbands is appreciated, as many 
biographies take such sacrifices for granted. 
Unfortunately, the execution of this strategy 
often falls flat because Mickle offers few de- 
tails about the wives’ actual contributions. 
Nevertheless, the book presents a set of com- 
pelling stories that may be eye-opening for 
readers who have grown up in a time when 
organ transplantation is taken for granted. 


Borrowing Life: How Scientists, Surgeons, and a War 
Hero Made the First Successful Organ Transplant 
a Reality, Shelley Fraser Mickle, Imagine, 2020, 288 pp. 


Wading Right In 


Reviewed by Andrew Sugden” 


Too often, wetland ecosystems are viewed 
as an unproductive obstacle to useful hu- 
man projects, and they have accordingly 
suffered—like so many other natural eco- 
systems—the depredations of development. 
Recognizing that the general public may “lag 
in their appreciation” of wetlands, Catherine 
Koning and Sharon Ashworth have set out 
on a mission to generate a wider interest in 
these fascinating and beautiful ecosystems, 
with a blend of science, natural history, and 
the personal stories of ecologists, wetland 
managers, conservationists, and more. 

Their focus is mainly the wetlands of the 
northeast United States—the plants and crea- 
tures that inhabit them and the people whose 
lives are enhanced by them. The book’s 
chapters are organized by wetland type— 
saltmarsh, forest wetlands, swamps, vernal 
pools, etc.—with final chapters on wetland 
restoration and on the connections between 
people and wetlands, both at the personal 
and community level. The book has a bibliog- 
raphy and plenty of further reading for read- 
ers wanting to dive in more deeply. 

The importance of wetlands cannot be 
overstated. They play a vital carbon seques- 
tration role in a changing climate. Coastal 
wetlands are also a protective bulwark 
between ocean and dry land and provide 
a haven for a huge range of resident and 
migratory wildlife. The authors’ efforts to 
highlight these benefits and to inspire a 
love of—or at least a tolerance for—mud, 


wind, and damp, flat horizons make this 
book an excellent and engaging introduc- 
tion to these ever-changing ecosystems. 


Wading Right In: Discovering the Nature of Wetlands, 
Catherine Owen Koning and Sharon M. Ashworth, 
Illustrated by Catherine Owen Koning, University 

of Chicago Press, 2019, 264 pp. 


The Alchemy of Us 


Reviewed by Mark Miodownik*® 


The questions of how much of our human- 
ity is due to material wealth and how our 
cultural values are baked into the materials 
we create are the subject of Ainissa Ramirez’s 
fascinating new treatise, The Alchemy of Us. 

Ramirez’s meditation on the materials that 
have facilitated community is particularly il- 
luminating. Here, she writes about the pho- 
nograph’s impact on how music was enjoyed. 
The ability to record music meant that the ex- 
perience of listening to it no longer had to be 
a communal one and spelled the end of much 
homemade folk music defined by materials 
such as brass, wind, or strings, but it also 
opened up uncharted horizons. The record- 
ings allowed a cross-fertilization of musical 
culture between jazz, blues, and rock and roll, 
even as the musicians themselves remained 
segregated by race politics. 

On the subject of race, Ramirez argues that 
a society that is racist will reflect racism in 
the substances that it makes. Because dark 
skin absorbs more light than white skin, for 
example, early photographs of black people 
were often barely recognizable. And although 
the cultural bias embedded in color film was 
corrected through chemical reformulations, 
it reemerged decades later in digital photog- 
raphy’s automatic facial recognition, which 
frequently fails to detect darker skin tones. 

The culture of innovation, Ramirez main- 
tains, does not belong only to privileged 
elites; it can be found in all those who care 
enough to reinvent the material world and, 
as a result, themselves. 





The Alchemy of Us: How Humans and Matter 
Transformed One Another, Ainissa Ramirez, MIT Press, 
2020, 328 pp. 


For a full-length review of The Alchemy of Us, see Science 
368, 41 (2020). 


10.1126/science.abf5636 





'The reviewer is a senior editor at Science. Email: phines@aaas.org *The reviewer is a senior editor at Science. Email: cash@science-int.co.uk *The reviewer is an associate editor at Science. Email: 
sscanlon@science-int.co.uk “The reviewer is a managing editor at the Science journals. Email: twagoner@aaas.org °The reviewer is a deputy editor at Science. Email: svignier@aaas.org °The reviewer 
is anews writer at Science. Email: kservick@aaas.org The reviewer is a senior correspondent at Science. Email: jmervis@aaas.org *The reviewer is the Letters editor at Science. Email: jsills@aaas. 
org °The reviewer is a senior editor at Science. Email: mlavine@aaas.org The reviewer is a news writer at Science. Email: acho@aaas.org "The reviewer is a news intern at Science. Email: cogrady@ 
science-int.co.uk “The reviewer is a senior editor at Science. Email: Izahn@aaas.org “The reviewer is a senior editor at Science. Email: ksmith@science-int.co.uk “The reviewer is a senior editor at 
Science. Email: ynusinov@aaas.org “The reviewer is a senior editorial fellow at Science. Email: asugden@science-int.co.uk The reviewer is at the Department of Mechanical Engineering, University 
College London, London WC1E 7JE, UK. Email: m.miodownik@ucl.ac.uk 


SCIENCE sciencemag.org 


Published by AAAS 


4 DECEMBER 2020 « VOL 370 ISSUE 6521 1161 


INSIGHTS 





PERSPECTIVES 


QUANTUM GASES 


Quantum-limited sound attenuation 


Resonantly interacting atoms confined by lasers have implications for neutron stars 


By Thomas Schaefer 


rdinary sound is a harmonic oscil- 

lation in the density, temperature, 

and velocity of air. Sound intensity 

decreases because of the spreading 

of the sound wave, but ultimately, 

sound attenuation is due to the dif- 
fusion of momentum and energy from the 
crest to the trough of the wave. This effect 
can be characterized in terms of the dif- 
fusivity D of sound. In air, there 
is a very large separation of scales 
between the shortest scale, the dis- 
tance between molecules; an inter- 
mediate scale, the mean free path 
of air molecules, which controls the 
diffusivity; and the longest scale, 
the wavelength of the sound mode. 
On page 1222 of this issue, Patel et 
al. (1) study a very different and 
deeply quantum version of sound 
attenuation. The authors’ result il- 
luminates the transport properties 
of strongly correlated quantum 
fluids (2), with direct implications 
for the stability of spinning neu- 
tron stars (3). 

Patel et al. confined 2 million 
lithium atoms in a cylindrical box 
created using beams of laser light 
(see the figure). The box is about 
100 um long and 60 um in radius. 
A typical standing wave in the ex- 
periment has a wavelength that is 
only about 10 times larger than the 
mean distance between atoms. To 
observe sharp collective modes in 
this regime, the gas must be very 
strongly correlated. Making the gas 
very cold and tuning the interac- 
tion between atoms to a resonance 
achieves this correlation. The temperature 
of the gas is between 50 and 500 nK, which 
implies that the de Broglie wavelength of 
the atoms is equal to or larger than the 
mean atomic distance. The de Broglie 
wavelength is the wavelength of the quan- 
tum mechanical wave function of the at- 
oms. The interaction between the atoms 
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is tuned by means of a so-called Feshbach 
resonance (4). At resonance, we can think 
of the interaction as having zero range but 
infinite scattering length. This means that 
the wave function of two low-energy atoms 
is modified by interactions even if the at- 
oms are arbitrarily far apart. 

The resonant limit is referred to as the 
unitary Fermi gas, because the isotropic part 
of the scattering cross section is as large as 
the conservation of probability (unitarity) 


Observing quantum sound waves 
A cylindrical box made of laser beams, 100 um long and 60 um in 
radius, contains about 2 million ultracold lithium atoms. Changing 
the light intensity of the cylinder’s endcaps excites sound waves. 
The decay of the sound waves (diffusivity) is measured by tracking 
the frequency width of the standing waves. 


we 
D *—_——————— Endcaps 






| 


A Lithium 
y atoms 


in quantum mechanics allows it to be. The 
unitary Fermi gas is also scale invariant. This 
means that physical observables are fixed by 
dimensional analysis and universal functions 
of dimensionless ratios. We can apply this 
type of argument to the sound diffusivity. On 
dimensional grounds, D is proportional to 
A/m, where h is the reduced Plancks con- 
stant and m is the mass of the atoms. 

The constant of proportionality is deter- 
mined by the detailed mechanism for en- 
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ergy and momentum transfer. If momentum 
transfer is governed by the diffusion of at- 
oms, then D ~ pl,,,,/M, where p is the mean 
momentum of an atom and Las is the mean 
free path. In a classical gas, Dlg >> À, but in 
a strongly correlated gas, we expect the prod- 
uct of p and l to be limited by quantum 
uncertainty, so that D is of order A/m. 

The results of Patel et al. demonstrate this 
limiting behavior. In the experiment, sound 
modes are excited by shaking the endcaps 
of the cylindrical box. The position 
of resonances determines the speed 
of sound, and the width of the reso- 
nance determines the diffusivity. 
Patel et al. find that the diffusivity 
drops as the temperature is lowered, 
settling around D ~ 1.5 /m near 
the transition to a superfluid. This 
value is consistent with attempts 
to measure the shear viscosity and 
thermal conductivity of the unitary 
Fermi gas individually (5, 6), as well 
as with theoretical calculations (7). 
Below the critical temperature, the 
unitary gas forms a superfluid that 
is roughly analogous to Bardeen- 
Cooper-Schrieffer superconductiv- 
ity, but with a parametrically large 
pairing gap and critical temperature. 
Notably, no sharp features are found 
in the diffusivity at the phase-transi- 
tion temperature. 

The results of Patel et al. have 
direct implications for the struc- 
ture of spinning neutron stars. The 
matter in the outer layer, below the 
crust but outside the core, of a neu- 
tron star is a dilute liquid of neu- 
trons. The neutron-neutron scatter- 
ing length is much larger than the 
distance between neutrons, mak- 
ing observations of the unitary Fermi gas 
directly applicable, even though the tem- 
peratures and densities are many orders of 
magnitude larger in the star. The dimen- 
sionless ratios, such as the mean particle 
distance in units of the thermal de Broglie 
wavelength, being similar is what matters 
for modeling the stellar interior. 

Neutron stars have many possible modes 
of oscillations. A special class that arises ow- 
ing to the Coriolis force in rotating stars is 
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known as Rossby modes, also called r-modes. 
These r-modes are unstable, and they would 
lead to strong gravitational wave emission 
and a rapid spin-down of the star if not 
damped by momentum or energy diffusion. 
Understanding the diffusivity of neutron 
star matter is crucial to predicting the range 
of allowed spin frequencies and possible r- 
mode signals in gravitational wave detectors. 

More generally, Patel et al. illuminate the 
mechanism of transport in other strongly 
correlated quantum gases, such as the quark- 
gluon plasma investigated in heavy-ion colli- 
sions at the Relativistic Heavy Ion Collider 
and the Large Hadron Collider. The quark- 
gluon plasma is a state of matter that existed 
microseconds after the Big Bang, at a tem- 
perature T ~ 2 X 10” K. Measurements in- 
dicate that the momentum diffusivity of the 
quark-gluon plasma is quite low. In a relativ- 
istic setting, the mass of the particles is very 
small, and the natural scale for D is hc’/(k,T ), 


“ „Observations of the unitary 
Fermi gas [are] directly 
applicable [to the physics of 
neutron stars], even though 
the temperatures and densities 
are many orders of magnitude 
larger in the star.” 


where c is the speed of sound and k, is the 
Boltzmann constant. Experiments based on 
the hydrodynamic expansion of the plasma 
give values as small as D ~ 0.1 he’/(k,T). 
This number has been interpreted in terms 
of holographic models inspired by advances 
in string theory (8). However, in relativistic 
heavy-ion collisions, the precise mechanism 
of momentum transport is difficult to deter- 
mine. This problem can potentially be tack- 
led in future experiments with cold gases, for 
example, by carefully mapping the frequency 
dependence of the response of the gas to ex- 
ternal perturbations. 
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PLANT IMMUNOLOGY 


Enzyme formation by 
immune receptors 


Upon pathogen recognition, some plant immune receptors 
assemble into active enzyme complexes 


By Lei Tianand Xin Li 


igher plants have evolved complex im- 
mune systems. Intracellular immune 
receptors known as nucleotide-bind- 
ing leucine-rich repeat (NLR) proteins 
are present in both plants and ani- 
mals; they are essential for immune 
responses (1). Upon infection, NLRs can rec- 
ognize specific pathogen molecules and acti- 
vate defense. In contrast to animals, which 
have a limited NLR repertoire, higher plants 
usually harbor hundreds of diverse NLR 
genes. However, little is known about their 
activation and signaling mechanisms. On 
pages 1184 and 1185 of this issue, Ma et al. (2) 
and Martin et al. (3), respectively, reveal the 
structure and activation mechanism of two 
NLRs: Arabidopsis thaliana RECOGNITION 
OF PERONOSPORA PARASITICA 1 (RPP1) 
and Nicotiana benthamiana RECOGNITION 
OF XOPQ 1 (ROQ1). Both NLRs self-assemble 
in a similar manner into tetrameric holoen- 
zymes to activate defense responses upon 
direct effector recognition. 

There are two classes of typical plant NLRs 
for sensing pathogen effectors: Toll-interleu- 
kin-1 receptor (TIR)-type NLRs (TNLs) and 
coiled-coil (CC)-type NLRs (CNLs), which 
are defined by their different amino-termi- 
nal domains. As with mammalian NLRs, 
plant NLRs also contain a central nucleotide- 
binding domain (NBD) involved in oligomer- 
ization, as well as carboxyl-terminal leucine- 
rich repeats (LRRs) that often participate in 
auto-inhibition and ligand recognition (J). 
TNLs and CNLs are activated with dissimi- 
lar mechanisms and signal through different 
downstream components. 

Cryo-electron microscopy  (cryo-EM) 
structural analysis of the A. thaliana CNL 
HOPZ-ACTIVATED RESISTANCE 1 (ZARI) 
revealed that it assembles into a pentameric 
“resistosome” upon effector recognition, 
reminiscent of animal inflammasome rings 
that mediate innate immune responses (4). 
ZARI and an adaptor protein recognize the 
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pathogen effector indirectly through moni- 
toring the status of a host protein (called 
a decoy) that is directly targeted by the ef- 
fector. The pentameric ring assembles upon 
binding of the effector-modified decoy, lead- 
ing to the formation of a funnel-shaped 
structure composed of the amino-terminal 
CC domains of ZARI. It has been hypoth- 
esized that this funnel associates with the 
cell membrane and triggers immune-related 
cell death (5). Until now, the resistosome 
structures of full-length TNLs have not been 
elucidated. 

TIR-containing proteins are widely pres- 
ent in bacteria, archaea, mammals, and 
higher plants (6). In mammals, TIR is a sig- 
nature scaffold domain of immune recep- 
tors, including Toll-like receptors (TLRs) and 
interleukin-1 receptors (IL-1Rs), and of some 
downstream adaptor proteins. Interactions 
of TIR domains between receptor and adap- 
tor proteins are required for immune and 
inflammatory signal transduction. For ex- 
ample, TLR4 recruits the signaling adaptors 
MYD88 (myeloid differentiation primary 
response 88) and MAL (MYD88 adaptor- 
like) through TIR-TIR interactions, thereby 
activating downstream transcription fac- 
tors such as nuclear factor kB (NF-KB) to 
induce inflammation (7). By contrast, a large 
number of TIR domains found in bacteria, 
archaea, and higher plants seem to serve as 
oxidized nicotinamide adenine dinucleotide 
(NAD*) hydrolases (NADases) upon self- 
association (6, 8, 9). TIR NADase activity was 
discovered in mammalian SARMI (sterile 
alpha and TIR motif-containing protein 1), 
a major executor of neuronal axon degen- 
eration (JO). Like SARM1, TIR domains in a 
number of plant TNLs also exhibit NADase 
activity (8, 9). 

Cryo-EM analysis of full-length SARM1 
revealed that in its resting state, it assem- 
bles into an octamer. The carboxyl-terminal 
Armadillo/HEAT motif (ARM) domains 
block the contact between adjacent TIRs 
through binding to NAD* (77). With nicotin- 
amide mononucleotide activator elicitation, 
the SARMI octamer undergoes a confor- 
mational change, disrupting NAD* binding 
sites of the ARM domains to enable TIR- 
TIR dimerization (11, 12). Interactions be- 
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tween the TIR domains then activate their 
NADase function. The depletion of NAD* 
and the generation of products including 
nicotinamide, adenosine diphosphate ribose 
(ADPR), and cyclic ADPR (cADPR) together 
seem to serve as signals to trigger cell death. 
Because plant TNLs lack ARM domains, it 
was unclear whether plant TNLs form simi- 
lar octamers or adopt a parallel self-inhibi- 
tion mechanism. 

The A. thaliana TNL RPPI recognizes 
its cognate effector protein ARABIDOPSIS 
THALIANA RECOGNIZED 1 (ATRI1) from 
the oomycete pathogen Hyaloperonospora 
arabidopsidis. In nature, both RPPI and 
ATRI genes are highly polymorphic; 
different ATRIs are perceived by spe- 
cific RPP1 variants (13). By contrast, the 
N. benthamiana TNL ROQ1 monitors 
Xanthomonas bacterial infections through 
direct recognition of the conserved effec- 
tor XANTHOMONAS OUTER PROTEIN 
Q (XopQ) (14). RPP1 and ROQI both rec- 
ognize their respective effectors ATR1 and 





the C-terminal jelly-roll and Ig-like domain 
(C-JID). The C-JID and LRRs mediate ef- 
fector binding specificity. Mutations of 
residues in these domains can disrupt their 
interaction interfaces with the effectors, re- 
sulting in reduced host cell death responses. 

For both RPP1 and ROQI, direct bind- 
ing of the effectors to the LRRs and C-JID 
likely releases the NBDs, which can then 
undergo conformational change and oligo- 
merize into a tetramer. The tetrameric ring 
brings the TIR domains into close contact, 
allowing them to form the final active TNL 
resistosome with NADase holoenzyme activ- 
ity. Intriguingly, unlike in ZARI and ROQI 
resistosomes, where deoxyadenosine tri- 
phosphate (dATP) and ATP are bound to the 
activated NLRs, RPP1 is bound by adenosine 
diphosphate (ADP) in the tetrameric ring, 
challenging the paradigm that NLRs are ac- 
tivated through exchanging ADP with ATP. 
Future analysis of additional TNL resisto- 
somes may help to resolve the biological role 
of this observation. 





Formation of a major type of plant resistosome 
Direct effector binding drives the tetramerization of a plant TNL receptor, such as RPP1 and ROQI. Formation of the 
holoenzyme complex through their BB-loop enables NADase activity, activating downstream immune responses. 


Six subdomains 
of RPP1 and ROQ1 
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C-JID, C-terminal jelly-roll and Ig-like domain; HD1, helical domain 1; LRR, leucine-rich repeats; NADase, nicotinamide adenine dinucleotide 
hydrolase; NBD, nucleotide-binding domain; NLR, nucleotide-binding leucine-rich repeat; RPP1, RECOGNITION OF PERONOSPORA PARASITICA 1; 
ROQ1, RECOGNITION OF XOPQ 1; TNL, TIR-type NLR; TIR, Toll-interleukin-1 receptor; WHD, winged helix domain. 


XopQ through direct NLR-effector protein- 
protein interactions, which trigger strong 
immune responses to restrict pathogen 
colonization, including host cell death. 

Ma et al. and Martin et al. used cryo-EM 
to determine the structures of the puri- 
fied TNL-effector resistosome complexes. 
Despite the differences of the pairs, RPP1- 
ATRI and ROQ1-XopQ form highly similar 
tetrameric clover-like structures (see the 
figure), which suggests that there may be 
a common activation mechanism for TNLs. 
These resistosome structures provide key 
insight about how RPP1 and ROQI1 can di- 
rectly recognize their cognate effectors and 
tetramerize to promote NADase activity, 
thereby activating downstream immune 
responses. A structurally distinct domain 
was uncovered at the carboxyl termini of 
RPP1 and ROQI1, which is referred to as 
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In both the RPP1 and ROQI resistosomes, 
tetramerization of the NBDs brings the four 
TIR domains close together. They form a 
two-fold dimer of dimers instead of a four- 
fold symmetric tetramer. The asymmetric 
TIR dimer forms one predicted NAD* bind- 
ing site while the symmetric TIR dimers 
stabilize the complex. Upon activation, 
two dimers arise in a head-to-tail manner 
mediated by the BB-loop (see the figure). 
Subsequent TIR NADase activity induces 
host cell death. Mutations of amino acids 
within the BB-loop that only affected the 
head-to-tail interactions were found to alter 
RPP1 and ROQI NADase activity and their 
function in immunity. Therefore, tetramer- 
ization of TIR domains of TNLs to form a 
holoenzyme likely represents a common 
activation mechanism for this large class 
of plant NLRs. Instead of homotetramer- 
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ization as observed for ROQ1 and RPPI re- 
sistosomes, hetero-NLR pairs with atypical 
domains or carboxyl-terminally truncated 
NLRs with only the TIR or TIR-NBD do- 
mains may serve to diversify the activation 
possibilities of these immune receptors in 
higher plants through heterotetrameriza- 
tion (J). 

The RPP1 and ROQ1 structures reveal a 
new type of TNL resistosome distinct from 
that of the CNL ZARI. These findings repre- 
sent a major step forward in understanding 
the activation mechanisms of plant TNLs. 
However, it remains unclear how the TIR 
NADase activity activates downstream im- 
munity. In the two activated TNL resistosome 
structures reported by Ma et al. and Martin 
et al., no NAD* substrate was detected. One 
explanation could be that activated TIR do- 
mains quickly cleave NAD*. Alternatively, 
NAD* may not be the preferred or only in 
planta substrate, because the detected in vi- 
tro NADase activities of plant TNLs are of- 
ten weaker than those of bacterial TIRs and 
mammalian SARM1 (8, 9). Consequently, the 
complete spectrum of TNL TIR products 
should be investigated. Furthermore, the 
NADase activity of TNLs signals through key 
downstream components and helper NLRs 
(including the CNLs ADRI and NRGI) for 
immune signaling in a manner that is still 
enigmatic (15). Finally, both RPP1 and ROQI 
resistosomes comprise TNLs that recognize 
their effectors directly. In many other cases, 
such as with ZARI, other host proteins are 
required for effector recognition and likely 
for resistosome assembly. Therefore, addi- 
tional structures of resistosomes of different 
natures will provide better understanding of 
the diverse activation mechanisms of higher 
plant NLRs. 


REFERENCES AND NOTES 


. J.D. Jones et al., Science 354, aaf6395 (2016). 

. S.Maetal., Science 370, eabe3069 (2020). 

. R. Martin et al., Science 370, eabd9993 (2020). 

. J.Wang et al., Science 364, 44 (2019). 

. J.M. Zhou, Y. Zhang, Cell 181, 978 (2020). 

. K. Essuman et al., Curr. Biol. 28, 421 (2018). 

. T.Veetal., Nat. Struct. Mol. Biol. 24, 743 (2017). 

. S.Horsefield et al., Science 365, 793 (2019). 

. L.Wanetal., Science 365, 799 (2019). 

. K.Essumanetal., Neuron 93, 1334 (2017). 

. Y. Jiang et al., Nature 10.1038/s41586-020-2862-z 
(2020). 

. M.Bratkowskietal., Cell Rep. 32, 107999 (2020). 

. K.V.Krasileva et al., Plant Cell 22,2444 (2010). 

. A.Schultink et al., PlantJ.92, 787 (2017). 

L. M. Jubic et al., Curr. Opin. Plant Biol. 50, 82 (2019). 


pe 
m. O WOON DANA WMN e 


= =e e e 
o1 & W PO 


ACKNOWLEDGMENTS 


We apologize for original literature not cited because of 
space limitations. We thank J. Chai and K. Ao for discus- 
sions and careful reading of the manuscript. Supported 

by Natural Sciences and Engineering Research Council of 
Canada (NSERC)-Discovery and NSERC-CREATE-PRoTECT 
programs and by a scholarship from China Scholarship 
Council (L.T.). 


10.1126/science.abf2833 


sciencemag.org SCIENCE 


GRAPHIC: A. KITTERMAN/SCIENCE 





PHOTO: PHAN AND PARK (1) 


ENGINEERING 


Miniaturization of robots 
that fly on beetles’ wings 


The shock-absorbing wings of the rhinoceros 


beetle battle in-flight collisions 


By Jiyu Sun 


or centuries, humans have been fas- 

cinated by flight. Leonardo da Vinci 

(1452-1519) meshed his skills as an art- 

ist, biologist, and engineer to sketch 

designs for flying machines modeled 

after bird and bat anatomy. Today, 
multidisciplinary scientists work system- 
atically from investigating biological pro- 
totypes to conducting flight performance 
tests on new bionic robots. On page 1214 of 
this issue, Phan and Park (J) describe how 
they used biology, robotics, and a little bit 
of art to design a new miniaturized micro 
air vehicle (MAV) that is bioinspired 
by the rhinoceros beetle (Allomyrina 
dichotoma). Their MAV mimics the 
beetle’s hindwings, which have ori- 
gami-like folds that allow the insect 
to recover from flight collision. 

Insects and birds can both fly, but 
they have different mechanisms and 
anatomy. Birds use muscles in their 
wings to control flight movements. 
Insects control flight with muscles in 
their wing base or thoracic region and 
also with the wing’s compliant struc- 
tures, which include folding patterns, 
small hairs, veins, and the elastic pro- 
tein resilin. Because miniaturization of MAVs 
drives the research, scientists increasingly 
have turned to insects such as beetles for bio- 
inspired MAV blueprints. These robotic de- 
vices must be small, lightweight, and robust. 
They should also exhibit excellent wind resis- 
tance, a sustained energy supply, and rapid 
folding of wings. 

The anatomy of various organisms can 
inform MAV-deployable wing design. For 
example, four-dimensional-printed _ elas- 
tic wings mimic the wing of the earwig (2). 
Four-degrees-of-freedom deformable fold- 
ing wings (3) and passive unfolding wings 
(4) were bioinspired by bats. However, the 
sizes of these wings are larger than those 
of beetle-bioinspired wings. Beetles differ 
from other insects in that their forewings 
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A miniature 
robot soars with 
collision-controlling 
deployable wings. 


(elytra) are hardened and encased, whereas 
their hindwings—which they use to fly—are 
deployable, a feature that directly enables a 
reduction in body size. Despite their small 
size, beetles exhibit an impressive flight abil- 
ity. For Anoplophora glabripennis beetles, 
the maximum distances traveled in a season 
are reported to be 2644 m for females and 
2394 m for males (5). The largest flight dis- 
tance recorded over the life span of an adult 
Asian long-horned beetle was more than 14 
km (6), and their maximum observed flight 
speed was 5.3 m/s (7). 

Previous beetle-based miniature flying 
robots have made use of compliant origami 





structures inspired by the wing vein in lady- 
bird beetles to achieve gliding and jumping 
functions (8). A mini drone has also been 
built with deployable wings whose design 
was based on the origami-like mechanism 
of insect wings (9). 

Phan and Park report that, in the rhinoc- 
eros beetle, the origami-like folds in their 
hindwings provided a shock-absorbing 
function during in-flight collisions without 
completely folding. The authors used this 
mechanism to build a beetle-inspired, flap- 
ping robot that collides without folding its 
wings, thereby enabling stable flight recov- 
ery after collision. 

Using a high-speed multicamera system, 
Phan and Park observed that beetle wings 
unfold by aerodynamic forces produced by 
flapping and then locked in place to sustain 
flight. Through experiments with beetles fly- 
ing between narrow poles at different angles 
and positions, the authors defined two means 
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by which the insects withstand in-flight colli- 
sions. One response involves perching on an 
obstacle with its legs when it hits the inner, 
rigid segment of the hindwing. The other re- 
sponse is to continue flight if the obstacle hits 
the outer, folding segment of the wing, which 
passively folds and springs back into place 
when the obstacle is passed. 

Phan and Park validated the beetle design 
principle by fabricating a miniature folding 
wing and integrating it in a previously de- 
scribed, small, flapping-wing robot (JO). The 
new miniature MAV displayed both wing un- 
folding and resilience to an obstacle hitting 
the outer, folding segment of the wing. This 
robustness should spur interest in this wing 
mechanism for aerial robots. 

Challenges remain for the development 
of deployable MAVs. Deployment inevita- 
bly depends on the fatigue resistance of 
the MAV wings, which is directly related to 
the design and use of the MAV. In nature, 
beetles have resilin in their hindwings at 
positions that require repeated folding (7) 
or joints that can soften and fold during a 
collision to avoid damage (12). These fea- 
tures can inspire future designs of deploy- 
able MAV wings. For flapping MAVs, 
vibration generated by the drivers, 
airborne components, and flapping 
actions can create uncontrolled take- 
offs and unstable free flight. Beetles 
can resolve these problems with flex- 
ible elastic biomaterials in their hind- 
wings and joints to absorb the energy 
caused by vibration. Thus, researchers 
might consider incorporating these 
features into MAVs with deployable 
flapping wings to increase the overall 
wing strength and flight stability. 

Future research should also focus on 
resolving the unsteady aerodynamics 
problem for MAVs with low Reynolds num- 
bers (the ratio of inertial forces to viscous 
forces). Scientists could then tackle the man- 
ufacturing technology for miniaturization of 
MAV components (such as airfoil and fuse- 
lage) and for flight control under unsteady 
aerodynamics conditions (13). 


REFERENCES AND NOTES 


. H.V.Phan, H.C. Park, Science 370, 1214 (2020). 

. J.Deiters, W. Kowalczyk, T. Seidl, Biol. Open 5,638 (2016). 

. A.Ramezanietal., Sci. Robot. 2,eaal2505 (2017) 

A.K. Stowers, D. Lentink, Bioinspir. Biomim.10, 025001 

(2015). 

. M.T. Smith et al., Environ. Entomol. 33, 435 (2004). 

. M.Javalet al., J. Appl. Entomol. 142, 282 (2017). 

. D.W. Williams et al., Environ. Entomol. 33,644 (2004). 

. S.M. Baeketal., Sci. Robot. 5,eaaz6262 (2020). 

. L. Dufour, K. Owen, S. Mintchev, D. Floreano, 2016 IEEE/ 
RSJ International Conference on Intelligent Robots and 
Systems (IEEE, 2016), pp. 1576-1581. 

10. H.V. Phan etal., IEEE Robot. Autom. Lett. 5,5059 (2020). 

ll. F.Haasetal.,Proc. R. Soc. London Ser. B 267, 1375 (2000). 

12. S.Mintchevetal.,/EEE Robot. Autom. Lett.2,1248 (2017). 

13. J.Sunetal.,J. Mech. Behav. Biomed. Mater.94, 63 (2019). 


10.1126/science.abf1925 


FPwNMr 


WO o ~nu oD O1 


4 DECEMBER 2020 « VOL 370 ISSUE 6521 1165 


INSIGHTS | PERSPECTIVES 





SEMICONDUCTOR PHYSICS 


Probing the dark side of the exciton 


Photoemitted electrons reveal large-momentum (“dark”) excitons in monolayer WSe, 


By Meng Xing Na! and Ziliang Ye! 


wo-dimensional (2D) semiconduc- 
tors, such as transition-metal dichal- 
cogenides, may enable new optoelec- 
tronic technologies (7). The optical 
excitation in these atomically thin 
materials creates tightly bound exci- 
tons composed of an excited electron and a 
valence-state hole (2), as well as a plethora 
of exciton complexes due to the reduced 
screening in Coulomb attraction (3-5). So 
far, excitons with large momenta have not 
been directly probed because photons only 
carry very small momenta and cannot di- 
rectly interact with large-momentum exci- 
tons, but these dark excitons are predicted 
to exist in certain 2D semiconductors (6, 7). 
On page 1199 of this issue, Madéo et al. (8) 
used time- and angle-resolved photoemis- 
sion spectroscopy (TR-ARPES) to directly 
probe dark excitons in monolayer tungsten 
diselenide (WSe,). By tracking the dynam- 
ics of electrons that constitute both bright 
and dark excitons, the authors reveal how 
both are formed and show that the latter 
outnumber the former at steady state. 
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There are two families of excitons in 2D 
semiconductors, with electrons at different 
valley-like local minima of the conduction 
band. Each valley is named after its location 
in the Brillouin zone (momentum-space). 
Bright excitons consist of electrons and 
holes both located in K valleys that form 
through the direct transition of the electron 
to an excited state with the same momen- 
tum (see the first figure). As the momentum 
of the exciton is defined by the momentum 


“These results...constitute 
a direct probe of dark-exciton 
formation and its binding 


energy and dynamics... 


difference between its constituting electron 
and hole, these so-called K-K excitons have 
zero momentum and can directly interact 
with light. Dark Q-K excitons are composed 
of an electron excited indirectly into a Q 
valley with a different momentum than that 
of the hole left in a K valley. Valleys with 
opposite momenta are equally populated in 
this study. 

Madéo et al. performed their TR-ARPES 
experiment by combining a photoemis- 


sion electron microscope (PEEM) with 
an extreme ultraviolet (XUV) light source 
(see the second figure). They first shine 
an ultrafast visible-light pulse at the WSe, 
monolayer. By tuning the wavelength of this 
pulse, they can excite either excitons or free 
carriers. After a variable delay time, an ul- 
trafast XUV pulse is shone onto the sample. 
The high photon energy of the XUV light 
then ejects electrons from the monolayer. 

With no surface state present, the authors 
directly accessed the electrons’ energy and 
momentum inside the material by analyz- 
ing the kinetic energy and emission angle 
of the ejected electron (measured here by 
an angle-resolved time-of-flight analyzer). 
Both the visible and XUV light are ultrafast 
pulses. The first visible pulse excites the 
system, and the second XUV pulse acts ef- 
fectively as a camera shutter. By fine-tuning 
the delay between the excitation and the 
shutter, the authors obtain a “movie” of 
electrons in momentum space in which free 
carriers, bright excitons, and dark excitons 
all have distinct signatures. 

The authors find that both bright and 
dark excitons exist in the monolayer WSe.,,. 
Notably, the bright K-K excitons prefer- 
entially convert into dark Q-K excitons 
through phonon scattering and lead to a 
density of dark excitons double that of the 





Bright and dark excitons 

Bright excitons have electrons in the K valley 
conduction bands and have zero momentum. Dark 
excitons have electrons in the Q valley conduction 
bands and carry momentum. 


K valley 


Q valley 
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_ Movies of dark-exciton formation 


Dark excitons cannot be probed directly with photons. Madéo et a/. used photoemission 







1 Excitation pulse ——e 








2 Exciton — 3 
formation C 


3 Extreme 
ultraviolet 
pulse 


Pump and probe 


: microscopy to reveal them in monolayer tungsten diselenide (WSe,). 


Photoemission microscopy 
Analysis of the resulting photoemission 
microscopy images gives the relative 
populations of the K and Q valleys and 
the dynamics of dark and bright excitons. 





A visible pump pulse generates excitons in WSe,, and an extreme 


ultraviolet probe pulse ejects electrons. 
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bright excitons. The dark excitons also have 
a longer lifetime, so they become a reser- 
voir for the bright excitons at steady state. 
Tuning the excitation light off the exciton 
resonance excites free electrons and holes, 
and after about 500 fs, a quasi-equilibrium 
exciton ensemble forms. 

By comparing the photoemission signal 
from excitons with the free-electron energy 
at the band edge, Madéo et al. deduce the 
exciton binding energy for both bright and 
dark excitons. These results, which consti- 
tute a direct probe of dark-exciton forma- 
tion and its binding energy and dynamics, 
were made possible by a convergence of im- 
provements in the spatial resolution of the 
ARPES instrument as well as the high pho- 
ton energy of the ultrafast high-harmonic 
XUV source that could probe the entire 
Brillouin zone of WSe, (9). 

Although dark excitons do not inter- 
act directly with light, the electrons that 
constitute such dark excitons are acces- 
sible by photoemission (10, 11). Besides 
the Q-K exciton, other dark excitons in 2D 
semiconductors that are accessible by the 
technique of Madéo et al. include interval- 
ley excitons, spin-triplet excitons, Rydberg 
states with finite angular momentum, and 
higher-order exciton complexes (12-14). 
Another equally rewarding direction would 
be to study the wave function of a single 
exciton that was obscured in the current 
study by phonon scattering and other scat- 
tering channels (10, 11). The rapid prog- 
ress in van der Waals heterostructures also 
calls for more direct techniques to resolve 
their emerging electronic structures (15). It 
is expected that there will be many excit- 
ing opportunities to apply these powerful 
electronic probes in studying the excited 
states of quantum materials. 
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PLANT SCIENCE 


Constraints on selfish 
behavior in plants 


Plants overproduce roots to secure resources nearby 
but avoid costly trips to neighbors’ patches 


By Marina Semchenko! 


e are used to human behavior, 
and the actions of other animals, 
being described as selfish, aggres- 
sive, or cooperative. Such words 
come up less often when contem- 
plating plants. Yet plants too have 
evolved a fascinating array of behavioral 
strategies in their struggle for resources, 
although these are hard to demonstrate 
and quantify. Whether and when it pays for 
plants to selfishly overproduce roots and 
preempt resource capture by competitors, 
or cooperate by restraining root growth, has 
been the subject of extended debate (, 2). 
On page 1197 of this issue, Cabal et al. (3) 
tackle this question with a new theoretical 
model and an empirical test that highlight 
spatial costs of nutrient foraging as key to 
resolving apparent discrepancies between 
previous studies. 

Besides establishing which behaviors 
occur in the plant kingdom, determining 
the extent of root overproduction in the 
scramble for limited soil nutrients is vital 
for understanding global carbon cycling and 
finding solutions to major challenges that 
face humanity. Plants capture atmospheric 
carbon during photosynthesis, and a large 
fraction is channeled into root production, 
representing a vast capacity for carbon stor- 
age and a potential focus for approaches to 
mitigate climate change (4). Global models 
of primary productivity and carbon stor- 
age—and ecosystem responses to climate 
change—could be substantially modified by 
accounting for overinvestment into capture 
of shared resources (5). Moreover, under- 
standing the factors that enhance or sup- 
press competitive root proliferation could 
open new avenues in crop breeding and ag- 
ricultural practice (6). 

Rather than focusing on the total mass of 
roots produced by competing plants, Cabal 
et al. bring a spatial dimension to root for- 
aging by modeling the increasing cost of nu- 
trient transport from locations further away 
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from the plant stem. This cost constrains 
the extent of root foraging, leading to par- 
tial segregation of root systems between 
neighboring plants. However, the situa- 
tion is different in areas close to the plant 
stem: Low nutrient-transportation costs 
make excessive root production cheap and 
give a competitive advantage against neigh- 
bors’ roots that have traveled from afar; 
the model predicts that plant roots should 
avoid costly trips to neighboring territo- 
ries but invest in securing resources closer 
to home. Empirical observations appear to 
support the predictions of the model. Cabal 
et al. corroborated the model using pep- 
per plants, and an independent study that 
mapped the spatial distribution of roots in 
several grassland species showed that some 
plants shrank their foraging ranges when 
experiencing stronger competition but ag- 
gregated roots in the remaining core areas 
(7). Therefore, it seems that a successful 
strategy involves excessive root production 
in areas with low costs, but reduced over- 
lap with neighboring root systems (see the 
figure). The latter has previously been inter- 
preted as cooperative behavior or territorial 
defense (8) but seems to emerge as part of a 
selfish strategy restrained by nutrient trans- 
portation costs. 

By manipulating relevant model pa- 
rameters, Cabal et al. provide further in- 
sights: Excessive root production should be 
strongest in dense vegetation, where short 
distances between competing plants pre- 
clude exclusive access to resources, and in 
productive ecosystems or species with low 
root-construction costs. This suggests that 
agricultural practices that involve high fer- 
tilization rates and high planting densities 
are particularly conducive to competitive 
root proliferation at the expense of eco- 
nomically essential yield. Hence, optimal 
fertilizer use efficiency and improved yield 
might be achieved by adjusting crop spac- 
ing and fertilization practices in light of root 
spatial behavior and by breeding varieties 
that are less responsive to neighbor pres- 
ence and show more restrained growth of 
competitive organs (9). Such advances have 
already been achieved, sometimes unin- 
tentionally, aboveground by breeding crop 
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Mapping plant roots 


Roots (white dots) explore soil widely in the absence of neighbors (left-hand cirles; root density increases from 
blue to yellow). In the presence of neighbor roots (black dots in the right-hand circles), the plant in the top 
row keeps roots close to its stem, whereas the plant in the bottom row places them toward neighbors’ roots (7). 


Root mapping enables testing of plant behavioral theory. 





Root density 
High 








varieties with low stature (6) and reduced 
responsiveness to shading by neighbors (10). 
Conversely, overproliferation of roots could 
be enhanced in cover crops to maximize the 
positive effects on soil organic matter and 
associated benefits for soil fertility and dis- 
ease resistance (11). 

The dependence of root production on the 
spatial locations of interacting roots high- 
lights the importance of fine-scale spatial 
measurements in resolving ongoing debate 
about plant behavior. By measuring only to- 
tal root production or root production over 
large spatial scales, we may miss local over- 
and underproliferation and risk gathering 
data that reflect experimental artifacts of 
soil volume and nutrients (2, 12). However, it 
should also be recalled that many plant spe- 
cies have evolved in conditions of perpetual 
competition, seldom growing in the absence 
of neighbors. Some plants may therefore al- 
ways overproduce roots close to their stems, 
independent of neighbor presence. Verifying 
the existence of selfish overinvestment in 
root growth would be extremely challeng- 
ing in such instances. Just as we do not ex- 
pect trees to stop producing trunks in the 
absence of surrounding trees, we should not 
necessarily expect root overproduction to be 
curtailed in the absence of competition (13). 

Current evolutionary models of plant 
root production do not require that plants 
are able to detect competitors but are purely 
driven by resource dynamics, which none- 
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Alone 





With neighbors 





theless require the existence of complex sig- 
naling mechanisms. However, we know that 
some plant species are capable of detecting 
the proximity and even genetic identity of 
neighbors, which may trigger measures to 
secure resources before direct competition 
takes place (see the figure) (14). Plants may 
also assess the relative competitive rank of 
their neighbors and choose to avoid costly 
battles with opponents that they have no 
chance of winning (7, 14). This raises the 
prospect of greater diversity in plant forag- 
ing strategies and points to further avenues 
for exploration. 
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Stimulating 
the brain to 
restore vision 


High-definition brain 
prostheses are developed 
for treating blindness 


By Michael S. Beauchamp and Daniel Yoshor 


ore than 70 years ago, electrical 
stimulation of the human visual 
cortex was shown to elicit the per- 
ception of a brief flash of light, or 
phosphene (7). Subsequently, there 
were numerous attempts to develop 
cortical visual prostheses (CVPs) that electri- 
cally stimulate the visual cortex to restore 
vision to people with acquired blindness 
(2-4). The basic design of a CVP is simple: 
A head-mounted camera captures the vi- 
sual scene, and a computer translates it into 
patterned brain stimulation. However, CVP 
implementation foundered on technological 
limitations, especially the size and complex- 
ity of the stimulation hardware. Advances in 
miniaturization and the efficiency of digital 
circuits suggest that it is time to try again (5, 
6). On page 1191 of this issue, Chen et al. (7) 
describe the implantation of more than 1000 
electrodes in the visual cortex of nonhuman 
primates (NHPs) to create artificial vision. 

This technical tour de force relied on fea- 
tures of early visual cortex shared by humans 
and NHPs. The visual cortex takes up a sub- 
stantial fraction of the cerebral tissue, ~20% 
in humans. This creates a surface area of 
many square centimeters that can accommo- 
date the implantation of electrodes. The vi- 
sual cortex is retinotopic, meaning that there 
is an orderly mapping between each location 
in the visual scene and each location in the 
brain. A CVP with an array of electrodes 
can provide an array of phosphenes, similar 
to individual lights comprising a stadium 
scoreboard, that can be activated to produce 
visual sensations (percepts). 

In natural vision, information from the 
visual scene moves through a _ hierarchi- 
cal network of processing stages, from the 
retina to the thalamus to primary visual 
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cortex (V1) and higher visual areas, such as 
the fourth visual area (V4). Chen et al. im- 
planted electrode arrays in both V1 and V4 
of NHPs. For CVPs to function effectively, 
the current level for each electrode must be 
individually adjusted so that the current is 
sufficient to produce a detectable phosphene 
but not so high that the phosphene expands 
to cover an extended region of space. This 
requires time-consuming calibration in 
which the participant reports their percept 
at multiple different current levels for every 
electrode. Chen et al. address this problem 
by stimulating electrodes in V1 while record- 
ing from electrodes in V4. They show that 
it is possible to estimate the appropriate V1 
stimulation current from the recorded neu- 
ronal responses in V4, a process that could 
be conducted automatically and rapidly for 
multiple electrodes. 

For millions of patients with damaged 
or diseased eyes leading to blindness, there 
are few or no treatment options. Recently, 


the stimulation currents are 10- to 100-fold 
less than that required for electrodes that sit 
further away atop the cortex, as in the FDA- 
approved CVP. When hundreds of electrodes 
are stimulated at once, low currents are 
essential to minimize both the power con- 
sumption of the device and the amount of 
current injected into the brain. 

A number of technological and biological 
issues remain. On the technological front, 
the electrode arrays used by Chen et al. re- 
quire a wired connection between the brain 
and the rest of the CVP. A wireless device 
will be necessary for long-term implantation 
of a clinical device in humans. Fortunately, 
considerable advances in neural stimulation 
with biocompatible wireless devices mean 
that solutions are close at hand (8). 

Phosphenes are experienced as_ bright 
flashes, not the rich colors and forms that 
characterize natural vision. The reason for 
this difference is likely that neurons in V1 
respond to simple visual features, such as 


lated electrodes to convey information. For 
instance, the letter T could be conveyed as 
a horizontal stroke followed by a vertical 
stroke. Human patients implanted with 
small numbers of visual cortex electrodes 
were able to identify letter shapes delivered 
using a combination of current steering and 
dynamic stimulation (14). 

Future studies should also investigate the 
full realm of possible transformations be- 
tween the visual scene and patterned brain 
stimulation. Advanced machine vision can 
extract relevant information from the visual 
scene, which could change based on circum- 
stance. For example, in a navigation task, 
arrow shapes could by delivered to signal 
the correct heading direction (15). After de- 
cades of false starts, there is a bright future 
for CVPs. Chen et al. set a new benchmark 
for the next generation of CVPs by demon- 
strating that 1000 electrodes are sufficient 
to create percepts of letters, orientation, 
and motion. Advances in wireless stimula- 





Brain stimulation to create artificial vision 
Electrical stimulation of the visual cortex is used to create the perception of letters. Letter shapes were decomposed into dot patterns and shown on a computer display 
to train nonhuman primates (NHPs). The NHPs learned 16 different letters. (An example training pattern for the letter “A” is shown.) Then, between 8 and 15 visual cortex 
electrodes were stimulated to create artificial percepts. Simulated percepts for the letters “A” (left) and “L” (right) are shown. 
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six patients were implanted with a CVP in a 
U.S. Food and Drug Administration (FDA)- 
approved clinical trial. However, the device 
has only 60 electrodes, limiting patients to 
simple tasks such as detecting the light or 
dark areas in a visual scene. By contrast, 
the device developed by Chen et al. com- 
prises 16 arrays of 64 electrodes each, for a 
total of 1024 electrodes. The high electrode 
count meant that Chen et al. could arrange 
phosphenes in the shape of different let- 
ters, which the NHPs were trained to dis- 
criminate (see the figure). In addition, the 
NHPs were able to accurately perform sim- 
pler tasks, such as making eye movements 
to the location of a phosphene, determining 
whether two phosphenes were in a horizon- 
tal or vertical configuration, and deciding 
whether two phosphenes were stimulated 
in one order or another, creating the impres- 
sion of apparent motion. 

The electrodes used by Chen et al. pen- 
etrated into the cortex. Because intracortical 
electrodes are near the stimulated neurons, 
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oriented lines. Stimulating these neurons 
produces a correspondingly simple percept 
(9, 10). Neurons in higher-level visual areas 
respond to more complex features, and elec- 
trical stimulation of these areas can produce 
the experience of seeing colors (17) or faces 
(72). It is intriguing to speculate whether 
the NHPs in the Chen et al. study could be 
induced to see more naturalistic patterns if 
V4 and V1 were stimulated at the same time. 

Even with 1024 electrodes, the letter 
shapes that can be generated are crude (see 
the figure). New array technologies with 
orders-of-magnitude more electrodes will 
facilitate the generation of more refined 
shapes. Advanced stimulation algorithms, 
akin to software that accompanies the CVP 
hardware, can also be applied. With current 
steering, electricity is delivered to adjacent 
electrodes to stimulate tissue between the 
implanted electrodes, creating more phos- 
phene locations that fill in the retinotopic 
map (13). Another technique, called dynamic 
stimulation, uses the sequence of stimu- 
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tion, high-density electrode fabrication, and 
stimulation algorithms offer hope that new 
devices will provide useful visual function 
for people living with blindness. 
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Mario J. Molina (1943-2020) 


Visionary environmental chemist 


By Kimberly A. Prather 


ario J. Molina, an environmental 
chemist who dedicated his life to 
explaining and solving key societal 
challenges, died on 7 October. He 
was 77 years old. Molina showed 
how chlorofluorocarbons (CFCs) 
were destroying Earth’s protective ozone 
layer over Antarctica and worked to ban 
them. He also spearheaded efforts to im- 
prove air quality in Mexico, the United 
States, and Asia. He was one of the first to 
sound the alarm that continuing business 
as usual would likely lead to a climate ca- 
tastrophe. With his passing, the world has 
lost a tireless advocate for our environment. 

Born in Mexico City on 19 March 1943, 
Molina loved science at an early age, con- 
verting a bathroom in his family home into 
his own personal chemistry laboratory. He 
received his bachelor’s degree in chemical 
engineering at the National Autonomous 
University of Mexico in 1965, his master’s 
degree from the Albert Ludwig University 
of Freiburg, Germany, in 1967, and his Ph.D. 
in physical chemistry from the University 
of California, Berkeley, in 1972. In 1973, 
Molina became a postdoctoral fellow at the 
University of California, Irvine (UCI), in the 
lab of chemist F. Sherwood Rowland, who 
would become his lifelong friend and collabo- 
rator. Molina spent the next 30 years teaching 
and researching at UCI, the Jet Propulsion 
Laboratory, the Massachusetts Institute of 
Technology, and the University of California, 
San Diego (UCSD). In 2005, he established 
the Molina Center for Strategic Studies in 
Energy and the Environment in Mexico City. 

As part of their Nobel Prize-winning 
efforts on the environmental fate of the 
widely used refrigerants and propellants 
known as CFCs, Molina and Rowland were 
the first to propose that once CFCs made 
their way into the stratosphere, they could 
be broken down by solar radiation into 
chlorine atoms that could destroy the ozone 
layer. They published their predictions in 
1974, but their calls to stop all production of 
CFCs fell on deaf ears because there was no 
evidence yet of ozone degradation. By 1985, 
evidence of a hole in the ozone was build- 
ing. Molina published his laboratory find- 
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ings in 1987, showing that stable chlorine- 
containing gases could catalytically break 
down and lead to ozone loss, bolstering a 
key connection between lab studies and 
global observations of the ozone hole. 
Throughout his career, Molina advo- 
cated for scientists to apply scientific re- 
search directly to solve societal problems. 
With Rowland, he helped implement the 
Montreal Protocol, which is considered the 
most successful and effective environmen- 
tal global treaty ever negotiated and imple- 
mented. It provides an inspirational success 
story of how fundamental chemistry can 
be used to address global environmental 





problems. In 1995, Molina became the first 
Mexican-born scientist to receive a Nobel 
Prize in Chemistry, which he shared with 
F. Sherwood Rowland and Paul J. Crutzen 
for “contributing to our salvation from a po- 
tential global environmental catastrophe.” 

I met Molina in the late 1990s at a con- 
ference. He was gracious, soft-spoken, and 
generous from the moment I nervously 
introduced myself. Over the years, Molina 
became my mentor, scientific collaborator, 
colleague, and close friend. While working 
on climate and air quality policy issues at 
his center in Mexico City, he joined forces 
with scientists at UCSD on fundamental 
chemistry issues related to atmospheric 
chemistry and climate and helped establish 
the National Science Foundation Center 
for Aerosol Impacts on Chemistry of the 
Environment (CAICE), headquartered at 
UCSD. As director of CAICE, I was honored 
to have Molina as my science adviser, pro- 
viding guidance and support as our team 
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strove to reproduce realistic tropospheric 
aerosol surfaces representative of the ma- 
rine atmosphere. Molina often marveled at 
how his lab studies on chemistry occurring 
on stratospheric surfaces were far easier 
than our efforts to reproduce more chemi- 
cally complex tropospheric marine aerosol 
surfaces. This represents Molina’s charac- 
ter perfectly—always downplaying his own 
(Nobel Prize-winning) accomplishments. 

Molina dedicated substantial time to com- 
municating the urgent need to enact policies 
grounded in science. He encouraged scien- 
tists to avoid getting caught up in figuring 
out every final detail before working to im- 
plement fixes. He applied this philosophy to 
the ozone hole, air quality, climate change, 
and ultimately COVID-19. This year, he spent 
extensive time communicating how to pro- 
tect against airborne transmission of severe 
acute respiratory syndrome coronavirus 2 
(SARS-CoV-2). His efforts, and those of many 
of the scientists he trained and mentored, 
myself included, are now leading to greater 
public awareness of the critical need for 
wearing masks to save lives. 

Molina was an elected member of the 
National Academy of Sciences, the National 
Academy of Medicine, the Vatican’s 
Pontifical Academy of Sciences, El Colegio 
Nacional de México, the Mexican Academy 
of Sciences, and the Mexican Academy of 
Engineering. He also served on the U.S. 
President’s Council of Advisors on Science 
and Technology under former presidents 
Bill Clinton and Barack Obama. He has been 
recognized with 29 honorary degrees and 
many awards, including the Presidential 
Medal of Freedom in 2013. 

I first realized that I was not alone in con- 
sidering Molina a hero when we worked to- 
gether in Mexico City (where he introduced 
me to the very best of our shared favorite 
food, molé). Throngs of people surrounded 
him wherever we went. I had never seen 
a scientist treated like a rock star before, 
and I will always remember how gracious 
he was—smiling, answering questions, and 
taking pictures with his “fan club.” 

Molina’s inspirational combination of 
outstanding science and global impact has 
shaped my own career and the lives of many 
others. By setting an example, he showed the 
importance of communicating science to im- 
plement positive change. He loved teaching 
and interacting with students and served as 
a role model for generations of scientists. 
The world has lost an exceptional scientist 
and an even better human being in Mario 
Molina. I am sure that, like so many of us, 
Mother Earth has shed tears of her own over 
the loss of one of her greatest champions. 
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Fossil electricity retirement 
deadlines for a just transition 


A 2035 deadline for decarbonizing U.S. electricity would 
strand only about 15% of fossil capacity-years 


By Emily Grubert 


ecarbonizing the electricity sector 

is critical for addressing climate 

change, particularly given the ex- 

pected role of an expanded clean 

electricity system for home heating, 

transportation, and industry (J). This 
will require vast investment in new infra- 
structure such as renewable-energy power 
plants and batteries. Absent major invest- 
ment in carbon-capture equipment or fuel 
switching, it will also require the retirement 
of carbon-based power plants. Both moti- 
vate explicit attention to a “just transition” 
(2) that ensures material well-being and 
distributional justice for individuals and 
communities affected by a transition from 
fossil to nonfossil electricity systems (3). 
Determining which assets are “stranded,” or 
required to close earlier than expected ab- 
sent policy, is vital for managing compensa- 
tion for remaining debt and/or lost revenue 
(4, 5). Here, I introduce a generator-level 
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model to show that in the United States, a 
2035 electricity decarbonization deadline, 
as proposed by President-elect Biden and 
the 2020 Democratic party platform (6, 
7), would strand only about 15% of fossil 
capacity-years and 20% of job-years, which 
is unusually low from a global perspective 
[see supplementary materials (SM)] (4). 
Such insights into the location and timing 
of potential plant closures are critical for 
informing specific, coordinated, and locally 
grounded planning, which can substantially 
improve transition outcomes but is neither 
widespread nor supported by a national 
framework (8). 

In 2018, 10,435 fossil fuel-fired genera- 
tors produced 63% of U.S. electricity with 
841 GW of capacity. They also emitted 
1.9 billion tonnes (Gt) of carbon dioxide 
(GtCO.,), 1.3 Mt of nitrogen oxides (MtNO,), 
and 1.4 Mt of sulfur dioxide (MtSO,), while 
consuming 3.2 billion m? of water for plant 
operations and fuel extraction. These fa- 
cilities operated in 1248 of 3141 counties, 
directly employed about 157,000 people at 
generators and fuel-extraction facilities, 
and paid sometimes locally meaningful 
taxes (see SM) (8). 
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Advance planning can help communities adjust 
to fossil fuel-fired power plant closures, as 
with the 2019 shutdown of the Bruce Mansfield 
Power Plant in Shippingport, Pennsylvania. 


Retirements are already under way: 126 
GW of fossil generator capacity was retired 
between 2009 and 2018, including 33 GW 
in 2017 and 2018 alone (see data S1). But 
simply understanding that a closure is com- 
ing in some indeterminate future does not 
prevent economic shock when closures are 
announced with layoffs (9). We have seen 
before what happens without adequate 
planning and preparation, such as with 
the collapse of the U.S. steel industry in 
the 1970s and 1980s. Policy interventions 
for a just transition might include plan- 
ning, training, and funding to stabilize local 
conditions (8) or political efforts to address 
broad social costs of transition (3). Effective 
just transition planning is participatory and 
government supported, with emphasis on 
stabilizing revenue, context-specific consid- 
eration of existing strengths and needs, fos- 
tering a willingness to change, and ensuring 
environmental remediation (8). All of these 
are much easier to achieve if the location 
and timing of step changes such as plant 
closures are known, which is a key focus of 
the work described below. 

By establishing a deadline, a 2035 elec- 
tricity decarbonization target represents a 
major opportunity to facilitate a just transi- 
tion. Relevant locations are already known: 
The facilities that need to close exist. Policy 
can ensure that timing is also known—for 
example, through closure deadlines that are 
consistent with overall decarbonization tar- 
gets. This new model supports such steps 
and extends committed climate emissions 
work (10, 11) to inform just transition-ori- 
ented industrial policy by evaluating spa- 
tially and temporally explicit implications 
of explicit plant closure deadlines for cli- 
mate pollution, air pollution, water use, and 
plant and fuel extraction labor. 


THE 2035 CHALLENGE 

Understanding which generators would 
have completed their reasonably antici- 
pated life span before decarbonization 
deadlines can clarify where policy is strand- 
ing an asset (recognizing that financial 
liability depends on conditions such as 
ownership, regulatory setting, and depre- 
ciation status). To contextualize the impact 
of a 2035 electricity decarbonization target 
in the United States, this work assumes 
that all fossil fuel-fired electricity genera- 
tors that were operational as of 2018 main- 
tain their 2018 outputs until retiring at the 
capacity-weighted mean age on retirement 
observed for generators with the same pri- 
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Most capacity reaches the end of its life span by 2035 
U.S. fossil fuel—-fired power plant capacity by fuel and date generator reaches fuel- and technology-specific life span (left axis), and system characteristics (generation, 
water consumption, employment, CO, emissions, SO, emissions, and NO, emissions) as percentage of 2018 value (right axis), 2018-2070. Most capacity reaches the end 


of its life span by 2035, and all by 2066. Leftmost stacked bar shows capacity that has already reached the end of its life span as of 2018. 
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mary fuel and technology (prime mover) 
between 2002 and 2018 (see table S2). Ob- 
served mean age on retirement is an ag- 
gregate outcome variable that depends on 
a combination of physical, economic, policy, 
and other factors. This age is consistent 
across regions and time, at about 50 years 
for steam turbine-based generators and 
about 30 years for other generators (11). 
This work assumes that generators 
achieve a typical life span if they (i) retire at 
the end of their fuel- and prime mover-spe- 
cific typical life spans and (ii) maintain con- 
stant 2018 outputs through retirement (see 
SM). The goal is to illustrate conditions that 
are consistent with what interested parties 
might reasonably expect at the asset level. 
For example, investor compensation would 
not generally be considered necessary if a 
generator maintains historical outputs and 
retires at a typical age, and a host commu- 
nity or employee would not be unusually 
burdened by closure under such conditions. 
Given these assumptions, this work 
shows a plausible, generator-level future for 
fossil fuel-fired electricity generation in the 
United States, with details on how requir- 
ing each generator to close at the end of its 
fuel- and prime mover-specific life span 
(see table S2) would affect generation, CO, 
emissions, NO, emissions, SO, emissions, 
water consumption, and labor associated 
with both the plant and its fuel extraction 
(see the first figure, data S1, and SM). 
Under these conditions, committed com- 
bustion CO, emissions from existing U.S. 
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fossil electricity generators account for an 
estimated 25 Gt of CO, (see SM). Of operable 
U.S. fossil fuel-fired generation capacity (630 
out of 840 GW), 73% reaches the end of its 
typical life span by 2035 (810 GW, or 96%, by 
2050; 100% by 2066). About 13% of U.S. fos- 
sil fuel-fired generation capacity (110 GW) 
operating in 2018 had already exceeded its 
typical life span. The remaining 27% would 
need to close or convert earlier than a typi- 
cal life span to meet a 2035 decarbonization 
deadline (see the second figure, data S1, fig- 
ures S3 to S15, and movie S1). 

Because typical life spans are averages, 
some generators run longer. Simply allow- 
ing facilities to run until they retire is thus 
likely insufficient for a 2035 decarboniza- 
tion deadline. Closure deadlines that strand 
assets relative to reasonable life span ex- 
pectations, however, could create financial 
liability for debts and other costs that can 
no longer be paid because of policy action. 
A key finding of this research is that a 2035 
deadline for completely retiring fossil- 
based electricity generators would strand 
only about 15% (1700 GW-years) of fossil 
fuel-fired capacity life, alongside about 
20% (380,000 job-years) of direct power 
plant and fuel extraction jobs remaining as 
of 2018. This is unusually low from a global 
perspective (largely because U.S. infrastruc- 
ture is older than average) (4), limiting the 
scope of potential financial liabilities while 
enabling important, no-cost local benefits 
of closure deadlines, such as certainty re- 
garding timelines. 
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POLICY ACTION FOR A JUST TRANSITION 
U.S. policy to decarbonize the electricity sec- 
tor by 2035 can facilitate a just transition by 
establishing explicit retirement deadlines 
for fossil fuel-fired electricity generators. 
In the large majority of cases (73%), such 
deadlines could be at or later than the rea- 
sonably expected end of life for a given gen- 
erator and still comply with the 2035 target, 
allowing for years of advance planning 
grounded in the specific assets and needs 
of a community, enabling development of 
concrete and shared visions of the future 
(12). Facilities that would be partly stranded 
by a 2035 deadline would have more than 
a decade for transition planning if policy 
were enacted in the early- to mid-2020s. 
Advance planning is particularly important 
because utility-owned facilities that would 
be stranded by a 2035 deadline, leaving rate 
payers responsible for debts, are dispropor- 
tionately in states with higher poverty rates 
(see figure S16), possibly indicating a role 
for federal support. 

Even when financial aid is not expected, 
knowing when and where a facility closure 
will happen can enable targeted deploy- 
ment of training resources for people who 
need them, long-term budgeting that ac- 
counts for tax revenue losses, and advance 
planning for transitioning individuals to 
local jobs in environmental remediation 
of fossil facilities. Clear expectations about 
when and where generator retirements will 
occur can also facilitate synergistic behavior 
in support industries, such as by allowing 
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coal mines to operate at efficient scales to 
stockpile fuel to close out existing contracts. 

Closure deadlines could be implemented 
in multiple ways. One option is to require 
generators to close by fuel- and prime 
mover-based typical end of life (see table 
S2) or 2035, whichever is first. Under the 
assumptions described here, this approach 
results in cumulative emissions of 20 Gt of 
CO,, 12 Mt of NO, and 13 Mt of SO,, support- 
ing 1.7 million fossil job-years. Extending 
the retirement deadline to 5 years past 
typical life span or 2035, whichever is first, 
supports 26% more fossil job-years but also 
commits 30% more CO,, 32% more NO,, and 
29% more SO,. Using U.S. Bureau of Labor 
Statistics wage data and federal guidelines 
for emissions costs, these differences are 
worth an estimated additional $55 billion 
in direct wages (in high-paying industries 
that support, on average, about three indi- 
rect jobs/direct jobs, for people who could 
potentially seek alternative employment), 
at the cost of an estimated $250 billion in 
air pollution costs and $400 billion in CO, 
costs (excluding methane emissions) (2018 
dollars) (see SM). 


Decarbonizing the electricity system can- 
not occur through plant closures alone. 
Large amounts of infrastructure will need 
to be built, with associated issues related to 
community identity and the just transition 
(13). However, large amounts of infrastruc- 
ture will need to not be built. A commitment 
to ajust transition away from fossil fuels also 
demands that we minimize new liabilities in 
the form of new-build power plants that will 
require transition before the end of their use- 
ful lives (14). The federal Energy Information 
Administration’s 2020 “Reference Case” for 
electricity through 2050, which assumes 
static policy conditions, includes more than 
50 Gt of CO, of potential committed emis- 
sions from not-yet-built fossil fuel-fired 
electricity capacity. Proscribing construc- 
tion of new fossil fuel-fired generators is 
likely the simplest available action toward 
a just transition, particularly because pro- 
posed new utility fossil assets (which would 
be paid for by rate payers) are also dispro- 
portionately in states with higher poverty 
rates (see SM and figure S17). 

Transition policy is political (3), and suc- 
cess relies on political support. Transition 


Mapping plants whose life spans extend beyond 2035 

Shown are U.S. fossil fuel—fired generators with estimated fuel- and technology-specific life span extending 
past 2035, operable as of 2018, with capacity aggregated to plant level and labels based on largest fuel share 
burned at combined generators in 2018. Larger circle size indicates larger capacity. Direct employment (at plants, 
coal mines, and natural gas extraction facilities) associated with plants with life span extending past 2035 are 
shown by county. County locations for plant employment match plant locations; county locations for coal 
mining employment match the location of the known or assumed mine responsible for the largest share of 
coal supply for a given plant: and county locations for natural gas extraction employment are approximated 
based on spatial distribution of natural gas extraction employment in the U.S. overall. 
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policy that includes not only implementa- 
tion details such as retirement deadlines 
but also universal social and economic 
programs that address transition impacts 
both in and beyond fossil fuel host commu- 
nities—such as affordable housing, a $15 
minimum wage, and job guarantees—can 
advance normative ideals of a just transi- 
tion while also increasing political support 
(1, 3, 15). Emphasizing universal programs 
recognizes that a transition focused solely 
on fossil fuel workers and communities 
is not just and that support is also badly 
needed for the many people who lose jobs 
or more as a result of climate change, as 
well as those likely to be affected by zero- 
carbon industrialization. 


CONCLUSIONS 

Policy proposals to decarbonize the U.S. 
electricity sector require not only the ad- 
dition of zero-carbon electricity generation 
but also the subtraction of carbon-intensive 
generation. Requiring fossil generators to 
close by 2035 would result in limited, al- 
though sometimes locally impactful, asset 
stranding relative to typical life spans. Ac- 
tions such as Clean Energy Standards to set 
explicit retirement deadlines or New Source 
Performance Standards to restrict new fos- 
sil fuel-fired generation capacity, combined 
with universal social programs, can support 
a just transition. 
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Hunting wolves is legal in Slovakia unless it threatens populations, but available data are insufficient to determine its effects. 


Edited by Jennifer Sills 


Evidence-based hunting 
policy needed in Slovakia 


The Swiss people recently rejected a 
law that would have allowed protected 
animals to be hunted (1), but hunting 

of vulnerable species such as wolves 
still occurs in Slovakia and elsewhere in 
Europe. The European Union’s Habitats 
Directive allows deliberate killing of 
wolves in nine countries (2) unless hunt- 
ing would threaten the sustainability of 
the population, but population data are 
inadequate in some countries. Slovakia 
must implement evidence-based policies 
to protect wolf populations. 

In 2016, Slovakia made changes to 
increase wolf hunting regulation and 
improve population monitoring (3). 
However, the changes have not been 
implemented nationally. Recently, the 
Slovak Ministry of Agriculture and Rural 
Development approved a quota of 50 
wolves for the upcoming winter season 
(4). Such policies should be based on a 
scientific assessment of the viability of 
wolf populations (5). Instead, the Ministry 
justified the number by citing misleading 
arguments about sheep farming and food 
security (4). 

In contrast to the government’s claims, 
wolves kill less than 0.1% of Slovakia’s 
sheep and goats (3). The recent policy also 
fails to acknowledge that sheep breeding 
in Slovakia declined between 2009 and 
2019, when 28 to 158 wolves were killed 
per year, suggesting that hunting did 
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not mitigate the problem (6). The food 
security justification is also specious: 
Sheep and goat products are only a small 
part of Slovak diet and accounted for less 
than 0.4% of gross agricultural production 
in the past 10 years (7). Instead of relying 
on misleading justifications for hunting, 
Slovakia should find alternative methods 
to minimize the risk of damages from 
large carnivores. However, the country has 
so far opted not to use EU funds available 
for this purpose (8). 

Policies in Slovakia target wolves as the 
only source of problems in the agricultural 
sector and ignore the market-based causes 
of the sheep decline that have been shown 
elsewhere in Europe (9, 10). Although wolf 
numbers are trending positively in Europe 
(11), Slovak hunting affects wolf recovery 
in neighboring Czechia, where the wolf 
population is protected (12). Without reli- 
able evaluation of hunting impact, Slovakia 
cannot make informed policy decisions, 
despite the country’s nominal adherence to 
EU regulations. Slovakia’s failure to collect 
adequate data and base policy on science 
is a dangerous precedent that undermines 
biodiversity conservation efforts in Europe 
and worldwide. 
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Computational social 
science: On measurement 


In their Policy Forum “Computational 
social science: Obstacles and opportuni- 
ties” (28 August, p. 1060), D. M. J. Lazer 
et al. propose ethical data infrastructures 
for computational social science research. 
Concentrating on access to platform trace 
data, they dismiss third-party market data 
from such companies as Nielsen and com- 
Score because of “opaque” methods and 
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high cost. We believe both have virtues, but 
their proper use requires a keener appre- 
ciation of each measurement regime. 

All data result from measurement 
processes designed and executed to serve a 
given institutional context (/, 2). Platforms 
profit from shaping usage and they mea- 
sure toward that end. Using their trace data 
to understand human conduct remains 
problematic as long as platforms are 
themselves opaque about their methods for 
managing user behavior (3). Social Science 
One and Twitter’s COVID-19 application 
programming interface may be productive 
precedents of platform data provision, but 
computational social science should reckon 
with the effects of platform measurement. 

Unlike platforms, third-party measure- 
ment firms are not invested in how users 
behave. As with public-sector data (such as 
the U.S. Census), third-party measurement is 
periodically audited (4). Its procedures and 
consequences are constantly appraised by 
actors with competing interests (5). Serving 
industries, policy-makers, and academics, 
third-party market research has invested 
for decades in refining what Lazer et al. 
aspire to: “an administrative infrastructure... 
enforcing compliance with privacy and eth- 
ics rules,” which aligns “with critical research 
norms” including “transparency, reproduc- 
ibility, replication, and consent” (3, 6, 7). 

Third-party measurement firms such 
as Nielsen and comScore supply data to a 
broad subscriber base of advertising agen- 
cies and content publishers, which lowers 
data costs. Academic institutions worldwide 
may access numerous such third-party 
datasets via Wharton Research Data Services 
and Chicago Booth, brokers that partner 
with third-party firms for this purpose. 
Meanwhile, public data can be cost prohibi- 
tive (such as CDC’s National Death Index). 

What ensures data’s “public account- 
ability” is not a public-sector origin but 
how the measurement regime is insti- 
tutionally arranged (3). In addition to 
expanding data collaborations and data 
infrastructures, attention to the mea- 
surement regimes of “found data” and 
reflexive triangulation across data sources 
are indispensable to development of com- 
putational social science. 
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Chinese sturgeon needs 
urgent rescue 


China’s construction of large hydropower 
stations on the Yangtze and Jinsha rivers has 
had a devastating impact on the resources, 
spawning and reproduction behavior, and 
migration habits of the Chinese sturgeon 
(Acipenser sinensis), a “living fossil” (J) with 
immeasurable scientific, ecological, social, 
and economic value. In the 1970s, there were 
more than 10,000 Chinese sturgeon breed- 
ing populations in the Yangtze River, but the 
number dropped to 2176 in the 1980s when 
the Yangtze’s first dam and hydropower 
station opened, then to 363 in 2000, and to 
57 in 2010 (2). The International Union for 
Conservation of Nature (IUCN) now lists the 
Chinese sturgeon as Critically Endangered 
(3), and the Convention on International 
Trade in Endangered Species of Wild Fauna 
and Flora (CITES) lists the species under 
Appendix II (4). China must take action to 
save this vulnerable species. 

The Jinsha River, located in the upper 
reaches of the Yangtze River, is the tra- 
ditional spawning ground and natural 
breeding habitat of the Chinese sturgeon (5), 
as well as the largest hydropower base in 
China (6). The construction of the cascade 
dams on the Jinsha River hinders the 
Chinese sturgeon’s ability to swim upstream 
to spawn and migrate and damages their 
habitat (7). Moreover, a series of dams on 
the upper reaches of the Yangtze River and 
the Three Gorges Reservoir have affected 
Chinese sturgeon spawning activities by 
leading to warmer water temperatures (8). 
Since the Three Gorges, Xiangjiaba, and 
Xiluodu projects began operation in 2008, 
2012, and 2013, respectively, the effective 
reproduction of the Chinese sturgeon has 
mostly been lost, and the wild population is 
facing extinction. 

China has established three Chinese stur- 
geon nature reserves (9) and implemented a 
10-year fishing ban in the Yangtze River (10), 
which began in January 2020. These efforts 
are not enough. Although China artificially 
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breeds and releases Chinese sturgeon, this 
strategy is not effectively restoring popula- 
tion numbers (11). To save the wild Chinese 
sturgeon population, targeted measures 
must be taken to decrease the impact of 
the Yangtze River dams and restore natural 
reproduction. The government should 
immediately reassess the ecological impact 
of the Jinsha River hydropower project, 
especially with regard to Chinese sturgeon 
and other migratory fish habitats. A Chinese 
sturgeon channel should be built in the 
hydropower stations along the Jinsha River 
to ensure the smooth flow of migratory fish 
in the Yangtze River. China should also pri- 
oritize the protection of aquatic organisms 
in the Yangtze River, increase protection of 
the habitats of the Chinese sturgeon in the 
Yangtze River and its coastal areas, strictly 
control fishing, and increase the number of 
artificial releases. 
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and progenitor cell production 
and differentiation. Genetics or 
chemicals could be used to alter 
the sugar code and change the 
efficiency of blood production 
in embryos. Thus, genetically 
or pharmacologically alter- 
, Saya A | 5 ing N-glycan biosynthesis in 
` OV OF Sy ee 2 Ee SR Le T S EO endothelial cells could improve 
E Say E | b the efficiency of the production 
ATG JOURNALS o—s SL Pe of blood stem cells, which could 
R rae i! ane rae A pe Th, tas then be used to treat blood dis- 
. ea : aa BAE eases such as leukemia. —BAP 
Science, this issue p. 1186 
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SPECTROSCOPY 
Probing quantum 
materials 


Unraveling the functionalities of 
quantum materials such as spin- 
valley-electronic, topological, 
and many-body effects provides 
a route to exploiting these mate- 
rials for applications. Borsch et 
al. introduce a spectroscopic 
technique based on the concept 
of crystal-momentum combs. 
By extending the ideas of 
frequency combs of metrology 
and superresolution imaging, 
they demonstrate the ability to 
directly map out the properties 
of quantum electronic structures 
under ambient conditions. Using 
this technique combined with 
accurate many-body computa- 
tions, they were able to reveal 
tomographic images of two- 
dimensional quantum materials. 





TROPICAL FOREST —|SO 
: = Science, this issue p. 1204 
Fruit decline threatens forest elephants 
arge mammal herbivores in African tropical forests are major consumers of fruit, and many 
tree species rely on these consumers for dispersal of their seeds. Bush et a/. monitored CORONAVIRUS 
fruit production over three decades in a protected national park in Gabon, showing an 80% z £ . - 
decline across the 73 plant species monitored. At the same time, photographic records of SARS CoV 2 antibodies 
forest elephants over the past decade indicate a substantial decline in body condition in persist 
these major herbivores. These results suggest that the capacity of the ecosystem to support the As the number of daily COVID-19 
elephant population is decreasing, a worrying prospect in an environment that is still protected cases continues to mount world- 
from other threats such as hunting and deforestation. —AMS Science, this issue p. 1219 wide, the nature of the humoral 


immune response to severe 
acute respiratory syndrome 
coronavirus 2 (SARS-CoV-2) 
remains uncertain. Wajnberg et 
al. used a cohort of more than 


African elephants in tropical forests depend on fruit, a food source that has greatly declined in the past ~30 years. 





DEVELOPMENTAL BIOLOGY | that line the walls of develop- regulating this transition are 30,000 infected individuals 
Su ar code re ulates ing arteries. The transition from also poorly understood. Kasper with mild to moderate COVID- 

6 6 endothelial cells to blood stem et al. found that microRNA-223 19 symptoms to determine the 
blood stem cells cells is highly regulated and intrinsically restrains the vascu- robustness and longevity of 
During embryonic development, restricted to a small portion of lar hematopoietic transition by the anti-SARS-CoV-2 antibody 
blood stem cells are derived endothelial cells during a brief regulating N-glycan biosynthesis response. They found that 
from vascular endothelial cells period of time. The mechanisms to restrict hematopoietic stem neutralizing antibody titers 
SCIENCE sciencemag.org 4, DECEMBER 2020 • VOL 370 ISSUE 6521 1177 
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against the SARS-CoV-2 spike 
protein persisted for at least 5 
months after infection. Although 
continued monitoring of this 
cohort will be needed to confirm 
the longevity and potency of 
this response, these preliminary 
results suggest that the chance 
of reinfection may be lower than 
is currently feared. —STS 
Science, this issue p. 1227 


Paring down pyroptosis 
Shiga toxin is a phage-encoded 
exotoxin that interrupts protein 
translation and functions as a 
virulence factor for enterohem- 
orrhagic Escherichia coli (EHEC), 
a human pathogen causing hem- 
orrhagic colitis and acute renal 
failure. Havira et al. screened a 
panel of EHEC mutants lacking 
various virulence factors to 
find those that interfered with 
inflammasome-mediated cell 
death. EHEC strains lacking 
Shiga toxin were more potent 
inducers of macrophage pyrop- 
tosis and interleukin-1B secretion 
than wild-type EHEC. Shiga toxin 
from wild-type EHEC interfered 
with pyroptosis by blocking the 
ability of the activated form of 
caspase-ll, a cytoplasmic lipo- 
polysaccharide sensor, to cleave 
gasdermin D and initiate the 
formation of gasdermin pores 
in the plasma membrane. This 
unanticipated activity of Shiga 
toxin provides EHEC with an 
additional means of evading the 
innate immune system. —|IW 

Sci. Immunol. 5, eabcO217 (2020). 





Colored transmission electron micro- 
scope image of a pathogenic 

strain of Escherichia coli that produces 
Shiga toxin 


1178 


Deep, hot, and more alive 
than we thought 


Marine sediments represent 
a massive microbial ecosys- 
tem, but we still do not fully 
understand what factors shape 
and limit life underneath the 
seafloor. Analyzing samples 
from a subduction zone off 
the coast of Japan, Heuer et 
al. found that microbial life, 
in particular bacterial vegeta- 
tive cells, decreases as depth 
and temperature increases 
down to ~600 meters below 
the seafloor, corresponding 
to temperatures of ~70°C. 
Below this limit, endospores 
are common—a remnant, and a 
potential reservoir, of bacte- 
rial life. Deeper still is a sterile 
zone, and below 1000 meters is 
a scalding realm populated by 
vegetative cells. At such great 
depths, high concentrations 
of acetate and sulfate coex- 
ist, and there are also signs of 
hyperthermophilic methano- 
genesis. These data provide 
a fascinating window into an 
extreme and inhospitable 
environment that nonetheless 
supports microbial life. —MAF 
Science, this issue p. 1230 


A regulatory switch 
in obesity 


The liver not only produces 
glucose but also consumes 
a large amount of glucose, 
making this organ critical for 
glucose homeostasis. Kokaji 
et al. performed multiomics 
analyses on the liver and blood 
of normal mice and in a genetic 
mouse model for obesity 
(ob/ob). Whereas normal 
hepatic metabolic responses to 
glucose were rapid and relied 
on regulation by metabolites, 
those in ob/ob mice were slow 
and depended on changes in 
gene expression. Thus, obesity 
in this model not only slows 
hepatic metabolic responses to 
glucose but also makes these 
responses more energy con- 
Suming and less precise. —WW 
Sci. Signal. 13,eaaz1236 (2020). 
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Neuroprotective 
neutrophils? 


The infiltration of activated 
immune cells into the central 
nervous system can contribute to 
the pathophysiology of condi- 
tions such as multiple sclerosis 
and traumatic brain injury. The 
roles that immune cells play in 
the resolution of inflammation 
and the regeneration of damaged 
tissue are less well understood. 
Sas et al. report the existence of 
a distinct population of Ly6G’® 
cells resembling immature 
neutrophils that were elicited in 
a mouse model of optic nerve 
injury. Intraocular injection of a 
fungal cell wall extract induced 
the migration of these cells into 
the eye, where they promoted 
retinal ganglion cell survival 

and axonal regeneration, in part 
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through a range of secreted nerve 
growth factors. These Ly6G" cells 
also spurred axonal regenera- 
tion in a mouse model of spinal 
cord injury. Similar regenerative 
capacity by an immature human 
neutrophil cell line suggests that 
immature granulocytes may be 
an attractive potential target for 
future neuroregenerative thera- 
pies. —STS 

Nat. Immunol. 21,1496 (2020). 


Sensing an allergen 
Dendritic cells (DCs) can initiate 
allergic immune responses, but 
how allergens influence DCs is 
unclear. Using a mouse model 

in which the allergen papain was 
injected into the skin, Perner et 
al. report that TRPV1* sensory 
neurons are required for an 
immune response to allergens. 
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Allergen-driven activation of 
TRPV-expressing neurons caused 
the release of the neuropeptide 
substance P, which resulted in 
the migration of CD301b* DCs to 
the draining lymph nodes. The 
induction of adaptive T helper 2 
immunity, including itching and 
pain responses, was observed. 
—PNK 

Immunity 53, 1063 (2020). 


Escherichia coli as the 


king’s dye maker 

Clothing colored with Tyrian 
purple, a dye extracted from mol- 
lusks in a laborious and expensive 
process, has been used over the 
ages as amarker of wealth and 
royal status. The main pigment, 
6,6-dibromoindigo, is simple but 
remains a challenge to synthesize 
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(2020). 


in pure form. Lee et a/. engineered 
two strains of Escherichia coll 
with enzymes that first convert 
tryptophan to 6-bromotryp- 
tophan and then oxidize and 
dimerize two molecules to form 
6,6-dibromoindigo. Switching out 
different tryptophan halogenase 
enzymes, the authors created 
a spectrum of halogenated dye 
variants. The resulting dyes can 
be used directly to stain fabrics 
without purification. —MAF 
Nat. Chem. Biol. 10.1038/ 
S41589-020-00684-4 (2020). 


Seeding the problem 

A key pathological event in 
Alzheimer’s disease (AD) is the 
aggregation and deposition of 
amyloid-B (AB). However, the 
formation of AB deposits is a 
relatively late readout of the AB 





Slow pokes avoid 
gridlock 


acterial colonies contain 
millions of cells yet show 
coordinated movement toward 
more favorable conditions at a 
colony’s margins. Species such 
as Pseudomonas aeruginosa contain 
external appendages called pili that 
grab the substrate and pull cells along. 
Meacock et a/. found that mutants 
that had more pili and that could move 
faster were disadvantaged under 
crowded conditions. Theory developed 
from liquid crystal research shows that, 
when densely packed, particles tend 
to align. In a densely packed colony, 
Slow bacterial cells overtake fast cells. 
This happens because when cells with 
differing orientations collide, the fast 
cells tend to spin vertically and become 
trapped in “rosettes.” This means that 
the trapped cells are left behind in the 
nutrient-poor and oxygen-depleted 
center of the colony. —CA 

Nat. Phys. 10.1038/s41567-020-01070-6 


A bacterial colony resembling liquid 
crystal, in which fast-moving Pseudomonas 
aeruginosa cells are forced to align into 
“rosettes” and are overtaken by slow cells 


aggregation process. Treatments 
that affect such deposits may 
fail to interrupt the critical early 
seeding stage of AB deposi- 
tion. It remains unclear when 
pathogenic AB seeds begin to 
form, propagate, and spread 
through the brain. Furthermore, 
the precise nature of the initial AB 
seeds remains unknown. Working 
in mouse models of AD, Uhlmann 
et al. tested a variety of known 
antibodies for their ability to 
neutralize AB seeds before amy- 
loid deposition was detectable. 
Early administration of one such 
antibody, aducanumab, signifi- 
cantly reduced AB deposition and 
the resulting pathology. This work 
points to targeting and removal 
of early AB seeds as promising 
future therapies for patients 
developing AD. —SMH 
Nat. Neurosci. 10.1038/ 
S$41593-020-00737-w (2020). 
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Dynamic control of 


boron esters 


Visible light at different 
frequencies can photoswitch 
azobenzenes between their 
F and Z isomers. Accardo et 
al. used the isomerization of 
ortho-substituted azoben- 
zene boronic acids to tune the 
affinity for diols to bind and 
form esters. Methoxy groups 
ortho to the azo group on the 
ring opposite the boronic acid 
group destabilized diol binding 
to the E isomer. The E isomer 
also formed favorable hydrogen 
bonds to water that inhibited 
displacement by the diol. Both 
effects helped to boost the rela- 
tive binding affinity for ethylene 
glycol binding to the Z versus E 
isomer to more than 20. They 
exploited this effect to create 
photoswitchable gels formed 
between four-arm amine-termi- 
nated poly(ethylene glycol)s 
that had end groups of either 
azobenzene boronic acids or 
extended diols. —PDS 

J.Am. Chem. Soc. 10.1021/ 

jacs 0c08551(2020). 


Protein IR spectra from 
machine learning 


Infrared (IR) absorption spec- 
troscopy is among the most 
powerful tools used to under- 
stand the atomic-level structure 
and the function of proteins. 
Theoretical interpretation of IR 
spectra generally depends on 
the ability to perform first-prin- 
ciples simulations, which may 
be very costly. Machine learning 
(ML) has recently emerged as 
an alternative way to overcome 
the need for such calculations. 
Ye et al. propose a ML method 
based on a few key structural 
descriptors and ab initio data to 
predict amide | region IR spectra, 
which is rich in fingerprints of 
protein structure and dynamics. 
Their trained model was able 
to provide fast and accurate 
characterization of IR spectra for 
different proteins under varying 
conditions. —YS 

J. Am. Chem. Soc. 142, 19071 (2020). 
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MICROBIOTA 
Getting to the guts of 
local evolution 


The microbiota of mammals 
is a product of coevolution. 
However, humans exhibit a range 
of adaptive peculiarities that can 
be quite geographically specific. 
The human microbiota also 
displays a variety of commu- 
nity compositions and a range 
of overlapping and redundant 
metabolic characteristics that 
can alter host physiology. For 
example, lactase persistence 
is a genetic characteristic of 
European populations, but in 
populations lacking the lactase 
gene, milk sugar digestion is 
endowed by the microbiota 
instead. Suzuki and Ley review 
the evidence for the role that the 
microbiota plays in local adapta- 
tion to new and changing human 
circumstances. —CA 

Science, this issue p. 1180 


CORONAVIRUS 
How lethal coronaviruses 
engage hosts 


Severe acute respiratory 
syndrome coronavirus 2 
(SARS-CoV-2) is closely related 
to the deadly coronaviruses 
SARS-CoV-1 and Middle East 
respiratory syndrome coronavi- 
rus (MERS-CoV). Considerable 
efforts are focused on develop- 
ing treatments, and therapies 
that work across coronaviruses 
would be particularly valuable. 
Shedding light on the host 
factors hijacked by the viruses, 
Gordon et al. mapped the 
interactions between viral and 
human proteins for SARS-CoV-2, 
SARS-CoV-1, and MERS-CoV: 
analyzed the localization of 

viral proteins in human cells; 
and used genetic screening to 
identify host factors that either 
enhance or inhibit viral infection. 
For a subset of the interactions 
essential for the virus life cycle, 
the authors determined the 
cryo—electron microscopy struc- 
tures and mined patient data to 


1179-B 


understand how targeting host 
factors may be relevant to clini- 
cal outcomes. —VV 

Science, this issue p. 1181 


INNATE IMMUNITY 
A ligand is located, 


at long last! 


Members of the Nod-like 
receptor (NLR) family act as 
intracellular sensors of infec- 
tion. Once they recognize 
pathogen-associated molecular 
patterns, they assemble into 
signaling complexes called 
inflammasomes, which induce 
proinflammatory cytokines and 
pyroptotic cell death. Although 
rodent NLR family pyrin 
domain containing 1 (NLRP1) 
can recognize bacterial toxins 
and protozoan pathogens, the 
ligands for human NLRP1 have 
remained elusive. Robinson et al. 
found that human NLRPI senses 
and is activated by enterovi- 
ruses. During human rhinovirus 
(HRV) infection, the HRV 3C 
protease cleaves an autoinhibi- 
tory N-terminal fragment from 
NLRPI, which is subsequently 
degraded. The NLRPI1 C-terminal 
fragment that is released then 
initiates inflammasome forma- 
tion. This work offers insights 
into immune sensing of respira- 
tory viral infections and provides 
an example of the N-terminal 
glycine degron pathway in 
human innate immunity. —STS 
Science, this issue p. 1182 


EVOLUTION 
Selection enhances 
mutation toleration 


Mutations generate variability 
that Is either neutral or subject 
to natural selection. Robustness 
is a measure of the ability to 
withstand deleterious mutational 
effects. Zheng et al. exposed 
Escherichia coli populations 
expressing a yellow fluorescent 
protein to strong, weak, or no 
selection for yellow fluorescence 
for four generations. They then 
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selected these populations to a 
related function, green fluores- 
cence, for four more generations. 
The strong selection first for 
yellow and then green fluores- 
cence resulted in the most green 
fluorescence and the accumula- 
tion of the most mutations. This 
outcome likely was due to the 
increased foldability of the pro- 
tein. Selection thus provides a 
threshold for mutation accumu- 
lation, but robustness maintains 
a buffer necessary for protein 
evolution. —LMZ 

Science, this issue p. 1183 


BIOMECHANICS 
Protection in the wings 


Beetles have hardened fore- 
wings for the protection of their 
bodies and hindwings, and their 
use during crawling or burrowing 
is well understood. However, the 
behavior of the larger hindwings 
during collisions in flight has 
not been clear because they 
do not readily flex. Phan and 
Park present a detailed study 
of the folding and unfolding 
mechanisms of the hindwings in 
free-flying rhinoceros beetles in 
which the wings were impacted 
during flight to simulate a clut- 
tered environment (see the 
Perspective by Sun). They found 
that origami-like folds in the 
wing could rapidly collapse on 
impact and then spring back, 
thus acting as shock absorb- 
ers and stabilizers. The authors 
replicated this behavior ina 
flapping-wing robot, enabling 
it to fly safely after collisions. 
—MSL 

Science, this issue p. 1214: 

see also p.1165 


PLANT SCIENCE 


Competitive roots 

Much of the world’s plant 
biomass exists out of sight 
underground tn the form of 
roots. Cabal et al. developed a 
theoretical model and tested it 
empirically to explain the rules 
that govern root growth (see 
the Perspective by Semchenko). 


Published by AAAS 


Plants adjust how and where 
their roots grow according to 
how close neighboring—and 
competing—plants might be. 
The model extracts some of the 
rules about how root balls differ 
when grown close to neighbor- 
ing plants compared with being 
grown in the absence of compe- 
tition. —PJH 

Science, this issue p. 1197; 

see also p. 1167 


QUANTUM GASES 
Watching sound die out 


A gas of strongly interacting 
fermionic atoms can serve as a 
model for systems with densities 
and energies spanning many 
orders of magnitude. This uni- 
versality of physics comes about 
thanks to a property known 
as scale invariance. Patel et al. 
exploited this concept to draw 
universal conclusions about the 
attenuation of sound in such 
systems by studying a homoge- 
neous gas of lithium-6 atoms at 
very low temperatures (see the 
Perspective by Schaefer). They 
found that below the superfluid 
transition, the sound diffusiv- 
ity behaved not unlike what 
has been observed in helium-4, 
a fluid of strongly interacting 
bosons. —JS 

Science, this issue p. 1222; 

see also p. 1162 


NEUROSCIENCE 
Restoring vision by 
stimulating the brain 


Electrical stimulation of the 
visual cortex has long been 
proposed as an approach to 
restoring vision in blind people. 
Previous studies positioned 
electrodes on the surface of 
the brain and thus required 
delivery of relatively high cur- 
rents. However, this approach 
limits the number of electrodes 
that can be safely stimulated 
simultaneously, and such sur- 
face electrodes activate several 
millimeters of cortex, which 
results in a low spatial resolution. 


sciencemag.org SCIENCE 
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Chen et al. demonstrated that 
the simultaneous stimulation of 
multiple intracortical electrodes 
in the monkey primary visual 
cortex gives rise to the percep- 
tion of shape and successive 
stimulation to the perception of 
motion (see the Perspective by 
Beauchamp and Yoshor). This 
major improvement provides 
proof of concept for the use of 
electrical microstimulation to 
create a form of artificial vision 
in the blind. —PRS 

Science, this issue p. 1191: 

see also p. 1165 


PHYSICS 


Probing the dark state 
Excitons, electron-hole pairs 
held together by Coulomb 
attraction, can be generated in 
semiconductors under excitation 
and greatly influence the mate- 
rial’s optoelectronic properties. 
Although bright excitons are 
optically active, their dark-state 
cousins have been more difficult 
to detect. They do, however, 
affect the optoelectronic proper- 
ties through their interaction 
with light and bright excitons. 
Madéo et al. developed a pump- 
probe photoemission technique 
that is used reveal the spatial, 
temporal, and spectral dynamics 
of excitons (see the Perspective 
by Na and Ye). Demonstrated 
in two-dimensional monolayer 
films of tungsten diselenide, 
the technique could also be 
applicable to other semiconduc- 
tor systems hosting excitonic 
excitations. —ISO 

Science, this issue p. 1199; 

see also p. 1166 


PLANT SCIENCE 
Tetrameric immune 
receptors 


Nucleotide-binding/leucine-rich 
repeat (NLR) immune recep- 
tors detect pathogen effectors 
and trigger a plant’s immune 
response. Two groups have now 
defined the structures of two 
NLRs that carry Toll-like inter- 
leukin-1 receptor (TIR) domains 
(TIR-NLRs) (see the Perspective 
by Tian and Li). Ma et al. stud- 
ied the Arabidopsis thaliana 


SCIENCE sciencemag.org 


TIR-NLR RPPI1 (recognition of 
Peronospora parasitica 1) and its 
response to effectors from an 
oomycete pathogen. Martin et al. 
studied the Nicotiana benthami- 
ana TIR-NLR ROQI] (recognition 
of XopQ 1) and its response to 
the Xanthomonas effector. Both 
groups found that these TIR- 
NLRs formed tetramers that, 
when activated by binding to 
the pathogen effector, exposed 
the active site of a nicotinamide 
adenine dinucleoside (NAD) 
hydrolase. Thus, recognition of 
the pathogen effector initiates 
NAD hydrolysis and begins the 
immune response. —PJH 

Science, this issue p. 1184, p. 1185; 

see also p. 1163 


GENE THERAPY 
Gene therapy for CLN2 
disease 


Late infantile Batten disease 
(CLN2 disease), a pediatric 
progressive brain disorder, is 
currently treated by infusion of 
human recombinant tripepti- 

dyl peptidase 1 (TPP1) into the 
cerebrospinal fluid every other 
week, which slows but does not 
halt progression of the disease. 
Sondhi et al. sought an alter- 
native treatment using gene 
therapy. They injected an adeno- 
associated virus vector enabling 
expression of the normal human 
coding sequence for the CLN2 
gene directly into the brain 
parenchyma of children with the 
disease. Progression of CLN2 was 
slowed in treated children but not 
to the same degree as in those 
treated with recombinant TPP1. 
Further improvements in gene 
therapy are needed before the 
progression of CLN2 disease can 
be halted. —MN 

Sci. Transl. Med. 12, eabb5413 (2020). 


HUMAN GENETICS 
Histone H3.3 mutations 
alter development 


Dominant de novo germline 
mutations in the human H3F3A 
or H3F3B genes that encode 
the H3.3 histone variant cause 
developmental delays, neuro- 
degeneration, and structural 





abnormalities. Bryant et al. 
describe 37 different missense 
mutations in 46 patients with 
overlapping phenotypes of 
variable severity. H3.3 marks 
active genes and influences gene 
expression and DNA damage 
repair. The mutations likely 
affect only 25% of the total H3.3, 
indicating exquisite sensitivity of 
development to H3.3 function. 
Prior studies report somatic 
mutations affecting H3.3 tails in 
cancer, but the germline muta- 
tions are located throughout 
the protein and differentially 
alter interaction with DNA, other 
histones, or other proteins. RNA 
sequencing with patient cell lines 
suggest that the common effects 
of the mutations stem from 
increased cell proliferation. —DD 
Sci. Adv. 10.1126/sciadv.abc9207 
(2020). 


CORONAVIRUS 
A decoy to neutralize 
SARS-CoV-2 


Many efforts to develop 
therapies against severe acute 
respiratory syndrome coronavi- 
rus 2 (SARS-CoV-2) are focused 
on the interaction between the 
spike protein, which decorates 
the surface of the virus, and its 
host receptor, human angio- 
tensin-converting enzyme 2 
(hACE2). Linsky et al. describe 
a de novo design strategy that 
allowed them to engineer decoy 
proteins that bind to the spike 
protein by replicating the hACE2 
interface. The best decoy, CTC- 
445, bound with low nanomolar 
affinity, and selection of viral 
mutants that decrease binding is 
unlikely because this would also 
affect binding to hACE2. A biva- 
lent version of CTC-445 bound 
even more tightly, neutralized 
SARS-CoV-2 infection of cells, 
and protected hamsters from 
a SARS-CoV-2 challenge. The 
stable decoy has the potential 
for respiratory therapeutic 
delivery. —VV 

Science, this issue p. 1208 
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MICROBIOTA 


The role of the microbiota in human 


genetic adaptation 


Taichi A. Suzuki and Ruth E. Ley* 


BACKGROUND: When human populations ex- 
panded across the globe, they adapted genet- 
ically to local environments in response to 
novel selection pressures. Drivers of selection 
include exposure to new diets, climates, or 
pathogens. Humans harbor microbiotas that 
also respond to changes in local conditions 
and changes in their hosts. As a result, mi- 
crobiotas may alter the adaptive landscape 
of the host through modification of the en- 
vironment. Examples include changes to a 
food’s nutritional value, the host’s tolerance 
to cold or low amounts of oxygen, or suscept- 
ibility to invading pathogens. By buffering or 
altering drivers of selection, the microbiota 
may change host phenotypes without coevo- 
lution between host and microbiota. Func- 
tions of the microbiota that are beneficial to 
the host may arise randomly or be acquired 
from the environment. These beneficial func- 
tions can be selected without the host exert- 
ing genetic control over them. Hosts may 
evolve the means to maintain beneficial mi- 


Local adaptation by humans and 
their microbiotas. When human 
populations adapt genetically 

to new environments, their 
microbiotas may also participate 
in the process. Microbes can 
evolve faster than their host, which 
allows them to respond quickly to 
environmental change. They also 
filter the host’s environment, 
thereby altering selective 
pressures on the host. Illustrated 
here are examples of interactions 
between adaptive host alleles and 
adaptive microbiota functions where 
the microbiota likely modified the 
adaptive landscape in response 

to changes in diet (e.g., changes 

in levels of starch and milk 
consumption), exposure to local 
pathogens (e.g., malaria parasites 
and Plasmodium spp.), and 
changes in local climate (e.g., cold 
Stress and hypoxia). In this paper, 
we discuss the resulting relationships | 
between host-adaptive alleles | 
and microbiota functions. 


Suzuki et al., Science 370, 1180 (2020) 


crobes or to pass them to offspring, which 
will affect the heritability and transmission 
modes of these microbes. Examples in humans 
include the digestion of lactose via lactase ac- 
tivity (encoded by the LCT gene region) in 
adults and the digestion of starch by salivary 
amylase (encoded by the AMYI gene)—both 
are adaptations resulting from shifts in diet. 
The allelic variation of these genes also pre- 
dicts compositional and functional variation 
of the gut microbiota. Such feedback between 
host alleles and microbiota function has the 
potential to influence variation in the same 
adaptive trait in the host. How the microbiota 
modifies host genetic adaptation remains to 
be fully explored. 


ADVANCES: In this paper, we review examples of 
human adaptations to new environments that 
indicate an interplay between host genes and 
the microbiota, and we examine in detail the LCT- 
Bifidobacterium and the AMY1I-Ruminococcus 
interactions. In these examples, the adaptive 
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host allele and adaptive microbial functions 
are linked. We propose host mechanisms that 
can replace or recruit beneficial microbiota 
functions during local adaptation. Finally, 
we search for additional examples where mi- 
crobiotas are implicated in human genetic 
adaptations, in which the genetic basis of 
adaptation is well described. These range from 
dietary adaptations, where host and microbial 
enzymes can metabolize the same dietary 
components (e.g., fatty acid and alcohol meta- 
bolism), through climate-related adaptations, 
where host and microbes can induce the same 
physiological pathway (e.g., cold-induced ther- 
mogenesis, skin pigmentation, and blood pres- 
sure regulation), to adaptations where hosts 
and microbes defend against the same local 
pathogens (e.g., resistance to malaria, cholera, 
and others). These examples suggest that mi- 
crobiota has the potential to affect host evo- 
lution by modifying the adaptive landscape 
without requiring coevolution. 


OUTLOOK: Well-studied examples of local adap- 
tation across diverse host species can be revisited 
to elucidate previously unappreciated roles for 
the microbiota in host-adaptive evolution. In 
the context of human adaptation, knowledge 
of microbial functions and host gene-microbe 
associations is heavily biased toward observa- 
tions made in Western populations, as these 
have been the most intensively studied to date. 
Testing many of the interactions proposed in 
this Review between host genes under selec- 
tion and the microbiota will require a wider 
geographic scope of populations in their local 
contexts. Because genes under strong selection 
in humans are often involved in metabolic and 
other disorders and can vary between popu- 
lations, future investigations of host gene- 
microbe interactions that relate to human 
adaptation may contribute to a deeper under- 
standing of microbiota-related diseases in 
specific populations. Investigating host gene- 
microbe interactions in a wider variety of 
human populations will also help researchers 
go beyond collections of anecdotes to form the 
basis of a theory that takes microbial contri- 
butions to host adaptation into account in a 
formal framework. A better understanding of 
reciprocal interactions between the host ge- 
nome and microbiota in the context of adap- 
tive evolution will add another dimension to 
our understanding of human evolution as we 
moved with our microbes through time and 
space. 
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MICROBIOTA 


The role of the microbiota in human 


genetic adaptation 


Taichi A. Suzuki and Ruth E. Ley* 


As human populations spread across the world, they adapted genetically to local conditions. So too did 
the resident microorganism communities that everyone carries with them. However, the collective 
influence of the diverse and dynamic community of resident microbes on host evolution is poorly 
understood. The taxonomic composition of the microbiota varies among individuals and displays a range 
of sometimes redundant functions that modify the physicochemical environment of the host and 

may alter selection pressures. Here we review known human traits and genes for which the microbiota 
may have contributed or responded to changes in host diet, climate, or pathogen exposure. Integrating 
host-microbiota interactions in human adaptation could offer new approaches to improve our 


understanding of human health and evolution. 


ow organisms adapt to new environ- 
ments is a central question in evolution- 
ary biology that is gaining importance in 
the face of rapid environmental change. 
A vast diversity of eukaryotic species 
harbor microbial communities, or microbio- 
tas, which influence their health and fitness 
(1). Host-associated microbiotas can provide 
novel ecological capabilities, evolve quickly, 
and exchange microbes or genes with other 
microbiotas, their hosts, and the environment 
(2-5). The list of microbes known to be bene- 
ficial to animals and plants is expanding, as is 
knowledge of their associated mechanisms. How 
host populations adapt to changing environ- 
ments through natural selection involving 
genetic change across generations is well ex- 
plored (2-5), but the role of microbiotas in 
mediating adaptation, particularly at the popu- 
lation level, is less well understood (Box 1). 
Drivers of local adaptation include changes 
in host diet, local climate, and novel patho- 
gen encounters. During the process of local 
adaptation, natural selection can act on the 
host, the microbiota, or their interactions 
(6). In host-microbiota partnerships where 
microbes are strictly transmitted from parent 
to offspring, selection can act on the host and 
its symbionts as a unit to favor microbial 
functions that enhance host fitness. This is 
known as the hologenome theory of evolu- 
tion (2, 3), in which reciprocal selection be- 
tween partners (i.e., host-microbial coevolution) 
is invoked. Coevolution is exemplified by the 
formation of organelles in eukaryotes and of 
endosymbionts in insects (7). The two part- 
ners become irreversibly dependent on each 
other and enhance each other’s fitness. The 
microbe, as an endosymbiont, must be trans- 
Department of Microbiome Science, Max Planck Institute for 
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mitted vertically because of genetic inter- 
dependence with the host. Several models 
have been developed that present the neces- 
sary conditions for host-microbial coevolution 
(2-5, 8, 9). 

For host-microbiota systems that are di- 
verse, complex, and dynamic—such as the 
gut microbiotas of animals or the leaf micro- 
biotas of plants—many members of the com- 
munity do not fit the criteria for coevolution 
(5, 6, 8). Members of complex microbiotas may 
have mixed transmission routes, and selection 
pressures on the host and microbes may be 
decoupled. Under these conditions, environ- 
mental acquisition and ecological filtering are 


the dominant processes that shape microbial 
community composition (JO). As a result, 
members of microbiotas are rarely selected 
to improve host fitness (11, 12). Rather, by 
maximizing their own fitness (Box 1), microbes 
can impose a fitness cost to the host—for in- 
stance, when they act as opportunistic patho- 
gens or pathobionts (71, 12). Indeed, many human 
gut microbes, such as Bacteroides species, are 
considered both beneficial and opportunistic 
pathogens, owing to their occasional participa- 
tion in infection by mucosal invasion, causing 
pathology (//). Just as most new mutations 
are deleterious to a genome, most new ran- 
dom changes to a microbiota will likely not 
improve host fitness, and those that are ben- 
eficial to the host will be rare. 

Rare and random acquisitions of a benefi- 
cial microbe or function are required for the 
microbiota to make a positive impact on host 
fitness and influence adaptation. There are 
two main ways the beneficial functions of a 
microbiota may be favored by natural selec- 
tion without coevolution. First, selection on 
microbial community processes alone can 
favor a beneficial microbiota (6). For example, 
Kikuchi et al. showed that stinkbugs can ac- 
quire soil microbes that have been preselected 
for the detoxification of insecticide in soil, 
thereby rendering the host resistant to insec- 
ticide (13). Kohl et al. showed that toxin-naive 
woodrats can acquire gut microbes that detox- 
ify plant toxins in their diet (74). In both cases, 
the selective agent driving the detoxification 
ability of the microbes is likely the toxic com- 
pounds themselves, rather than selection on 
host fitness. A notable example in humans was 


Box 1. Terminology 


Adaptation: We use the term “adaptation” to mean evolution through natural selection that involves 
genetic changes in a population across generations, as opposed to acclimation that occurs within an 
individual's lifetime. Local adaptation is when a subpopulation of a species genetically adapts to its 


local condition. 


Microbiota: [he assemblages of bacterial, archaeal, and eukaryotic microorganisms and their viruses 


that make up a microbial community. 


Narrow-sense heritability (h): The ratio of additive genetic variance to phenotypic variance within a 
population (96, 97). Note that h* of microbiota measurements can be confounded by vertical 
transmission of microbes in some studies including chip heritability (the proportion of phenotypic 
variation that can be explained by the distribution of genetic markers or single-nucleotide poly- 
morphisms) (98). However, h* estimates in twin studies are independent of transmission modes because 
monozygotic and dizygotic twins are not expected to have different amounts of vertically transmitted 
microbes (98). Relative abundances of both vertically and horizontally transmitted microbes can have 
significant h? estimates if there are significant additive genetic variants in the host genomes that affect 
the relative abundances of the microbes. Heritability in Box 2 is h°. 

Vertical transmission: A process of passing on a microbe(s) from parents to their offspring (99). 
Transfer of microbes from the mother’s birth canal or skin to the offspring is an example of vertical 


transmission (100, 101). 


Horizontal transmission: A process of passing a microbe(s) between hosts that are not in a parent- 
offspring relationship (i.e., interhost transmission) (99). Transfer of microbes among social partners is an 
example of horizontal transmission (102, 103). The term may also include environmental transmissions 
(e.g., environmental acquisition of microbes from food and soil) (104). 
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uncovered by Hehemann et al., who showed 
that in Japanese populations the gut bacte- 
rium Bacteroides plebeius has acquired a 
porphyranase gene from the seaweed-dwelling 
bacterium Zobellia galactanivorans, and people 
harboring this microbe have thereby acquired 
the ability to break down algal carbohydrates 
(15). More recent evidence shows that the 
enzymatic capacity to break down glycans from 
seaweed is generally enriched in the gut micro- 
biomes of several Asian populations compared 
with those of other groups and is prevalent in 
different species of Bacteroides (16). Although 
the fitness consequences of the seaweed- 
derived energy to the host are unclear, these 
examples show that selection acting on micro- 
biota-encoded processes alone may benefit 
the host without requiring the host to evolve 
adaptive mutations. 

Selection based on differences in host traits 
has been shown experimentally to also drive 
differences in the microbiota that promote 
those host traits. For instance, Panke-Buisse et al. 
performed an experimental evolution study in 
which transfer of soil microbiota from batch to 
batch of Arabidopsis thaliana seeds, while 
keeping the plant genetic stock constant, drove 
divergence in flowering time (7. Similarly, 
others have selected for soil microbiotas that 
alter the biomass of A. thaliana (18) and Medicago 
sativa (19). In these examples, selection for soil 
microbiotas alone was sufficient to drive changes 
in host phenotype, without coevolution. Beyond 
affecting host phenotype, the microbiota can 
shape the patterns of host genetic adaptation. 
Rudman et al. showed that selection based on 
differences in the Drosophila melanogaster 
microbiota caused divergence in host popula- 
tions. These authors identified divergent loci 
that were previously associated with adapta- 
tion in natural populations (20). Together, these 
empirical studies establish the potential for the 
microbiota to drive host adaptation by altering 
the host’s environment. 

A few evolutionary models predict how the 
microbiota affects the rate of host-adaptive 
evolution by extending host phenotypic plastic- 
ity through the microbiota (4) and incorporating 
the microbiota into models of quantitative 
genetics (27). Formal models that assess the 
evolutionary dynamics between host-adaptive 
alleles and the microbiota are lacking, but first 
we need evidence for such interactions. 

Here we propose examples that suggest an 
interplay between host genes and the micro- 
biota during the process of adaptation. We 
highlight two examples of interactions across 
multiple generations between human alleles 
and associated microbes and discuss how the 
host can replace or recruit beneficial microbes 
without requiring specifically coevolved orga- 
nisms. We also discuss other adaptive traits in 
humans, where microbes may have interacted 
with host alleles during local adaptation. 
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Host gene-microbe associations in 

human adaptation 

Adaptation of modern human populations to 
local environments and the genetic bases of 
these adaptations are well characterized (22). 
Differences in selection pressures, including 
diet, climate, and pathogen exposure, have re- 
sulted in differences among human popula- 
tions in traits related to digestion (23, 24), 
physiology (25, 26), and immunity (27, 28). 
These differences result from local genetic 
adaptations occurring across multiple gener- 
ations and are distinguishable from adapta- 
tions by the immune system or physiological 
acclimations within the lifetime of a host (Box 1). 
In many cases, the primary genetic variants 
responsible for these adaptive traits have been 
identified: These include mutation and trun- 
cation of lactase (encoded by the gene region 
LCT) required in lactose digestion (29), the 
role of hypoxia-inducible factor 2-alpha (encoded 
by EPAS]) in high-altitude adaptation (26), and 
beta-globin gene haplotypes or sickle cell 
traits (encoded by HBB) in engendering malaria 
resistance (27), all of which represent classic 
examples of local adaptation (22). Moreover, 
compositional and functional variation of the 
human microbiota has been described exten- 
sively (30). Amato et al. recently reviewed 
beneficial functions of the microbiota that 
have likely affected local adaptation in humans, 
such as the microbiota’s enhancement of nu- 
trition and protection from infectious disease 
(31). However, specific interactions between 
host-adaptive alleles and adaptive microbiota 
functions during local adaptation in humans 
remain to be explored. 


Host genes replace beneficial 
microbiota functions 


Mammals are characterized by the behavior 
of feeding their offspring maternal milk—the 
main energy supply to infants coming from milk 
sugar lactose. In the upper gastrointestinal tract 
(GIT), the host enzyme lactase cleaves the milk 
sugar lactose into glucose and galactose, which 
are then absorbed. After weaning, when milk 
is no longer part of the diet, the production of 
lactase ceases [lactase nonpersistence (LNP) is 
the ancestral condition]. Animal domestica- 
tion on different continents (2500 to 10,000 
years ago) and the consumption of nonhuman 
milk repeatedly led to strong selective pres- 
sure for lactase production to persist into 
adulthood [lactase persistence (LP)] (29). In 
the present, allelic variation at the LCT locus 
predicts the LP phenotype in multiple human 
populations, and the independent develop- 
ment of LP has resulted in a variety of LP 
alleles (29). 

Many microbes that are common constituents 
of the gut microbiota can use B-galactosidases 
to cleave lactose and then ferment the pro- 
ducts, and lactose may be sequestered by lactic 
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acid bacteria. Regardless of how it is pro- 
cessed, the by-products of microbial lactose 
usage yield less energy to the host (32). This 
implies that the host and the microbes are in 
competition for lactose and that the host has an 
energetic incentive to outcompete the microbes 
(Fig. 1, A and B). LP hosts outcompete microbes 
for lactose, because lactase is active in the upper 
GIT where microbial biomass is low. LNP hosts, 
by contrast, mainly access the less-energy-dense 
products of microbial lactose metabolism in the 
lower GIT. Undigested lactose passing to the 
colon can thus be considered a form of fiber 
that requires microbial enzymes to process. 
As with fiber degradation in the colon, the 
microbiotas able to degrade lactose derive energy 
from fermentation, and their fermentation 
products [short-chain fatty acids (SCFAs)] con- 
stitute an energy source for the host. Thus, 
although the microbiota takes a share of the 
energy content of lactose, the alternative in a 
LNP host is to lose the energy source entirely. 

Recent studies performed in Western pop- 
ulations have associated the same variant at 
the LCT locus with the relative abundance of 
Bifidobacterium in the gut microbiota. This 
association is the most consistent signal of 
microbiota genome-wide association studies 
(GWASs) to date (33) (Box 2), and the link de- 
pends on milk consumption (34). The inverse 
relationship between LP genotype and relative 
abundances of Bifidobacterium supports the 
notion that the mammalian lactase enzyme 
and bacterial §-galactosidase enzyme are in 
direct competition for lactose. Bifidobacteria 
are important lactose degraders (35, 36) and 
may have helped adult hosts extract energy 
from milk after domestication of dairy-producing 
animals and before selection on the LP alleles. 
After the rise of pastoralism, lactose-metabolizing 
bacteria likely had a positive effect on the fitness 
of hosts in the absence of LP alleles (29). Later, 
the beneficial host allele arose and replaced the 
microbiota function, becoming nearly fixed in 
some populations (23, 35). The production of 
B-galactosidase by Bifidobacterium (or func- 
tionally redundant microbes) may also have 
relaxed the selection pressure on the LP allele 
by reducing the fitness differences between 
LCT genotypes. Additionally, if microbial 
consumption of lactose minimizes the dis- 
ease outcome related to lactose maldiges- 
tion and promotes milk consumption that 
carries additional benefits (36), the micro- 
biota may be mitigating energetic trade- 
offs and maintaining LCT polymorphisms 
in human populations. 


Host genes recruit beneficial microbiota 
functions 


The lactose example suggests that activities 
encoded by both the microbiome and the 
host genome may be particularly susceptible 
to competition between host and microbiota. 
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Fig. 1. Examples of associations between host genotype and the microbiota. The inverse associations 
between LCT genotypes and the relative abundance of Bifidobacterium are described for (A) LNP hosts and 
(B) LP hosts. The positive associations between AMY1 gene copy number (CN) and relative abundance of 
Ruminococcus are described for (C) low-AMY1 CN hosts and (D) high-AMY1 CN hosts. Different types of 


Sugars are shown in different shapes and colors. 


In contrast to lactose, which is a single sugar, 
the variety of starches in the human diet 
provides an opportunity for the host to access 
one form of starch with amylase in the upper 
GIT, leaving resistant starches for the micro- 
biota to digest in the lower GIT. Indeed, 
starches take many different forms, including 
structures recalcitrant to degradation by the 
host enzyme amylase, which cleaves starch into 
glucose subunits. The degree to which host and 
microbiota partition the starch substrates likely 
depends on host genotype. 

Salivary amylase, encoded by the AMYI 
gene, begins the process of starch breakdown 
to glucose in the mouth. Individual differences 
in salivary amylase activity are positively cor- 
related with the copy number (CN) of the 
AMY1 gene, which typically ranges from 2 to 
15 (24, 37). Compared with individuals that 
have low AMYI CNs, those with high AMYI 
CNs more thoroughly deplete a given diet of 
amylase-susceptible starches before these foods 
reach the lower GIT and its microbiota (Fig. 1, 
C and D). A low AMYI CN of 2 is the ancestral 
condition: The expansion of AMY] CN in 
humans was likely driven by a dietary shift 
from a low-starch diet to a high-starch diet 
(24, 38, 39). Indeed, the flanking regions of 
the AMYI genes show evidence of a selec- 
tive sweep in human populations, which 
supports the origin of AMY1 duplications 
after the split from Neanderthals but before 
the origin of agriculture (38, 39). Higher levels 
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of salivary amylase likely promoted fitness of 
individuals in populations with a high-starch 
diet as agriculture developed. Similar to the 
evolution of LP, host amylase conferred a 
fitness advantage by its greater energy yield 
compared with that of microbial fermentation 
products. Hosts with high AMY7 CNs addi- 
tionally benefit from enhanced degradation of 
resistant starch by colonic microbes. 
Compared with the gut microbiotas of indi- 
viduals with low AMYI CNs, those of high- 
AMYI1 CN hosts are enriched in Ruminococcus 
(37), a genus important in resistant-starch fer- 
mentation (40, 41). In low-AMYI1 CN hosts, 
gut bacteria preferentially ferment amylase- 
susceptible starch that hosts cannot use be- 
cause they have low levels of salivary amylase. 
Ruminococcus, or functionally redundant 
microbes, may have strengthened the selec- 
tion pressure on the expansion of AMY] CN by 
amplifying the relative fitness difference 
between individuals with high and low AMYI 
CNs and may have reinforced the host gene- 
microbe association (Fig. 1, C and D). This 
dynamic could thus lead to the recruitment 
of microbes on the basis of host genotype. 


Human adaptations that may involve 
gene-microbe interactions 


There is emerging evidence of microbial in- 
volvement in several other human genetic 
adaptations (Fig. 2) in which the role of mi- 
crobes was previously underappreciated. 
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Adaptation to dietary shifts 

Lactose and starch consumption (33, 37) are 
probably the first known examples of gene- 
microbe interactions related to diet that could 
be investigated further for a role in human 
adaptation, but others are likely to emerge. 
For instance, long-chain polyunsaturated fatty 
acids (LC-PUFAs) are key nutrients that can 
be acquired directly from animal-based diets 
or synthesized from plant-derived precursors 
using host-derived enzymes (42-45). Evidence 
of positive selection on genes related to fatty- 
acid metabolism—including fatty-acid desaturase 
(FADS) (42-45), carnitine palmitoyltransferase 
1A (CPTIA) (46), and pancreatic lipase related 
protein 2 (PLRP2) (47)—has been associated 
with dietary adaptations in multiple human 
populations. The ancestral allele of the FADS 
gene region is associated with populations 
that consume a plant-based diet (44, 45), and 
the derived allele is associated with current 
and historical populations that favor a high-fat 
diet (42). Bacteria belonging to at least 10 phyla 
appear to biosynthesize LC-PUFAs (48). Among 
human gut microbes, Lactobacillus plantarum 
and others have been shown to metabolize the 
plant-derived precursors and affect the bio- 
availability of PUFA-derived metabolites in vitro 
and in vivo (49-51). Human gut microbes that 
are enriched by a plant-based diet (52-54) or 
a high-fat diet (55), respectively, have been 
described (Fig. 2). Similarly, the alcohol de- 
hydrogenase 1B (ADH1B) locus shows evidence 
of positive selection in humans (56, 57) and has 
been associated with increased ethanol intake, 
which is thought to have accompanied the 
development of agriculture (57). Human and 
bacterial enzymes can convert ethanol to 
acetaldehyde; variation in the gut micro- 
biota has been associated with alcohol con- 
sumption and related diseases (58, 59) (Fig. 
2). These observations suggest that during 
shifts from a predominantly plant-based to a 
largely animal-based diet, and toward increased. 
alcohol consumption, members of the micro- 
biota may have played an adaptive role. 


Adaptation to climate-related factors 


Humans have adapted to local climates (60). 
Populations living at high latitudes face cold 
stress, and genes related to temperature sensing 
(TRPMS8) and energy harvest (e.g., HADS and 
CPTIA) have been associated with climate- 
related adaptations (42, 60, 61). The human 
microbiota also varies by latitude and may 
contribute to climate-related adaptation by 
increasing the efficiency of energy extraction 
from the diet and subsequent fat storage 
(62). Some of this variation in the microbiota 
is associated with ambient temperature (63-66) 
and host body size (67, 68) (Fig. 2). Experimental 
studies in mice have shown that changes in 
ambient temperature alone can induce changes 
in the gut microbiota (63-66). The microbiotas 
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Box 2. Microbial heritability in relation to host adaptation. 


As used here, the term “heritability” differs from the terms related to transmission modes of microbiota, 
such as “inheritance. Heritability is a statistical measurement used to estimate the proportion of 
phenotypic variation in a population that is explained by genetic variation (96) (see Box 1 for definitions). 
Heritability has been estimated for the relative abundances of taxa or genes (33, 72). For the human 
gut microbiota, ~10% of gut taxa have been shown to be heritable (105), and similar proportions of 
heritable microbes have been reported in maize root microbiotas (106). Note that microbial heritability 
estimated in twin studies is independent of microbial transmission mode (see Box 1). 

In general, heritability estimates of relative abundances of microbial taxa in the human gut microbiota tend 
to be low compared with estimates for traits such as height (33). Low heritability estimates for microbiota 
traits can be attributed to environmental noise and/or nonadditive genetic variance (107, 108). For example, 
temporary changes in the gut microbiota caused by dietary shifts (40, 41) or pathogen overgrowth (11, 12, 31) 
may lower heritability estimates. Alternatively, strong purifying selection acting on host genetic mechanisms 
that affect the microbiota could lead to fixation of genetic variants and also reduce heritability estimates. 
Indeed, population genetic theories (109) and empirical studies (108, 110, 111) demonstrate that the traits 
most tightly associated with fitness tend to show lower heritability (e.g., fecundity) than traits that are less 
associated with fitness (e.g., morphology). Thus, heritability does not provide the relative contributions of 
genetics and environment to a trait. Despite its limitations, heritability does provide information on how well 
genetics can predict a trait, which then motivates GWASs to identify the genetic basis of heritable microbial 
traits (98). Furthermore, heritability and related measurements (112) have been used to predict responses to 
selection in evolutionary, agricultural, and medical studies (97). Many of the same microbes are heritable 
across several human populations (33) and mammalian species (113). These findings support the value of 
characterizing heritable components of the microbiota for use in studies that investigate the underlying 
genetic and nongenetic mechanisms. 

Heritable components of the microbiota are more likely than nonheritable components to be involved in 
genetic adaptations. In the early stages of adaptation, locally beneficial microbes can be acquired by the host 
population and may not necessitate a change in the host genome to facilitate their maintenance (13-15). 
During host acclimation to the new environment, beneficial microbes would therefore remain nonheritable. In 
the long term, however, the host population may evolve genetic variants to replace or recruit these beneficial 
microbes (Fig. 1), such that nonheritable microbes will become heritable. In principle, when the allele that 
maintains the beneficial microbiota function reaches complete fixation, the heritable component of the 
microbiota will revert to being nonheritable, because every individual in the population will carry the allele. 
However, complete fixation of a locally beneficial allele is unlikely in the face of migration from nonadapted 
populations (114). Thus, heritable microbial measurements provide a starting point to identify components of 
a microbiota that may be involved in ongoing host genetic adaptations to local environments. 


Relationship between heritability and transmission modes 

Transmission modes will likely play a role in the stability of microbial heritability over time. For the 
relative abundances of microbial taxa or genes to be stably heritable over time, they must be present every 
generation and associated with host genetic variation. If acquisition of microbes from the environment or 
other individuals is less reliable than acquisition from parents, the heritability of horizontally acquired 
microbes should be less stable over time than the heritability of vertically acquired microbes. For many 
microbes within the human microbiota, the degree of vertical transmission remains to be characterized, 
but more and more microbial taxa show evidence of parent-to-offspring transmission (100, 101) and 
population-level transmission across many generations (115-117). 


of cold-exposed hosts exhibit higher absorp- 
tive capacity and energy harvest (63), as well 
as elevated heat production by brown adipose 
tissue (63-66). However, whether microbiota- 
mediated thermogenesis and TRPM8 poly- 
morphisms are linked in humans remains to 
be determined. 

Climate is known to be an important selec- 
tive pressure for genes involved in common 
metabolic disorders (47, 60, 69). The most con- 
sistent and highly heritable (Box 1) taxon in the 
human gut, the bacterial family Christensenellaceae, 
is associated with lower body mass index (BMI) (67) 
and metabolic disorders (70). Methanobacteriaceae, 
Dehalobacteriaceae, and members of Tenericutes 
also show heritability in multiple human 
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populations (33) and co-occur with the Chris- 
tensenellaceae (67). Recent evidence points to 
metabolic interactions between these partners 
that may affect metabolite availability to the 
host (71). Despite the significant heritability 
estimates for these taxa, no significant host 
genetic variants have been identified by GWASs. 
This suggests that some clearly heritable taxa 
may have polygenic nature with many genes 
with small effects (72). Akkermansia, a mucin 
degrader, has also been associated with BMI 
(73) and shown as a heritable taxon across 
multiple studies (33). The relative abundance 
of Akkermansia has been associated with a 
host genetic variant in PLDI (74), a gene pre- 
viously associated with BMI (75). PLDI shows 
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evidence of positive selection in African popu- 
lations, but the selective agent is unclear (76). 

Populations living at low latitudes face high 
levels of solar radiation, and genes related to 
skin pigmentation are classic examples of 
local adaptations in humans (25, 60) (Fig. 2). 
Melanin in skin protects from ultraviolet (UV) 
radiation, and its antimicrobial properties have 
been suggested as an alternative hypothesis for 
the latitudinal gradient of skin pigmentation 
(77). Accordingly, an increase in skin pigmen- 
tation in human melanocytes can be induced 
by exposure to microbial products such as 
lipopolysaccharide (77). Guéniche et al. reported. 
a protective effect of Lactobacillus spp. against 
skin and systemic immune responses after UV 
exposure (78). Given the emerging role for 
the gut microbiota in the regulation of skin 
homeostasis (79), further research may help 
to identify the potential interactions between 
pigmentation genes and microbes of the skin 
and gut. 

High-altitude environments also impose many 
physiological challenges, including hypoxic stress 
(26), which may also be altered by the micro- 
biota. A variety of adaptive traits and genes 
related to oxygen respiration and blood circu- 
lation that often involve the regulation of blood 
pressure have been identified in high-altitude 
environments (26) (Fig. 2). Recently, the gut 
microbiota has been shown to regulate blood 
pressure via production of SCFAs (80), and it is 
possible that SCFAs help regulate blood pres- 
sure in high-altitude environments (81-83). 
The production of SCFAs, including butyrate, 
influences the activity of hypoxia-inducible fac- 
tors in mice (84), which is also a key pathway 
involved in high-altitude adaptation in multiple 
human populations (26). Production of bacte- 
rial angiotensin-converting enzymes (ACEs) 
that act on the renin-angiotensin system may 
also alter blood pressure in high-altitude envi- 
ronments (83). Bacterial ACEs have been shown 
to convert mammalian angiotensins in vitro (85). 


Adaptation to local pathogens 


Pathogens are considered one of the main 
drivers of adaptive evolution in humans (86) 
(Fig. 2), and the interface between the micro- 
biota with pathogen defense and immunity 
has been extensively studied (11, 30). A text- 
book example of local adaptation in humans 
is the genetic basis for resistance to malaria, 
caused by several species of parasitic proto- 
zoans in the genus Plasmodium (27). Skin 
and gut microbiotas may influence different 
stages of malaria (87). Genetic variations in 
the same cytokine-related genes, including 
IL10, IL12, TNF, and IFNARI, have been asso- 
ciated with variations in malaria severity (27) and 
microbiota composition (33, 88). Ippolito et al. 
summarized several hypotheses for multiple 
roles the microbiota may play in malaria re- 
sistance: (i) production of volatile compounds 
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by skin bacteria that alter the attraction of 
mosquito vectors, (ii) antigen expression by 
gut microbes that induce antibodies conferring 
protection against Plasmodium transmission, 
and (iii) differences in the microbiota affect- 
ing the severity of blood-stage Plasmodium 
infections (87). 

Other infections that may be influenced by 
the combination of microbiota and host genetic 
variation include cholera, HIV, and helminth 
infection. Cholera toxin targets several ion 
channels with genes that show signatures of 
selection (28, 89). Gut microbial taxa associ- 
ated with susceptibility to, and recovery from, 
cholera have been identified in human popu- 
lations (90, 91). Experimental studies have 
identified bacterial taxa that provide coloni- 
zation resistance (90, 92) and suppression of 
Vibrio cholerae virulence (90). Given that ex- 
pression and function of ion channels are often 
altered in gastrointestinal disorders (93), there 
may be value in exploring links between micro- 
biota and ion channel genes for treatment or 
prevention of diarrheal diseases. The vaginal 
microbiota has been suggested to play a role 
in acquisition and transmission of HIV (94), 
and the gut microbiota has been suggested 
to interact with helminths and alter disease 
outcome (11). Further investigation of inter- 
actions between well-studied host-adaptive 
alleles (22, 95) and emerging beneficial mi- 
crobiota functions may uncover previously 
unknown mechanisms of disease resistance. 

All of the host gene-microbe examples 
mentioned here (Fig. 2) are still at the early 
stages of investigation. But in each of these 
cases, the microbiota has the potential to 
affect host evolution by modifying the adap- 
tive landscape. These microbes likely modified 
the host’s environment, resulting in altered 
selection pressures on host phenotypes. In 
future research on the relationship between 
adaptive host genotypes and adaptive micro- 
biota functions, GWASs, candidate gene ap- 
proaches, or transplant experiments using gene 
knockout models could help to identify host 
genetic mechanisms that maintain beneficial 
microbiota function and to test whether re- 
placement and recruitment of microbiotas are 
general features of adaptation. 


Conclusions 


Host-associated microbiotas have affected the 
ecology and evolution of a variety of organ- 
isms, including humans, yet studying the 
interaction between adaptive host alleles and 
the microbiota remains a challenge. There are 
many opportunities, however, to revisit well- 
established examples of local adaptation in 
humans to search for previously unappreciated 
roles for the microbiota. This will require 
research into human populations and micro- 
biomes across the globe and in their local 
contexts. Genetic studies have taken more and 
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more populations into consideration, as have 
studies that characterize microbiomes. But so 
far, studies that include both host genetics and 
the microbiota have, with few exceptions, been 
performed predominantly in Western popula- 
tions. We need to expand the reach of human 
genetic studies that include a microbiome 
component to a wider range of populations 
and locations. Findings from these types of 
studies will test the ubiquity of current findings 
and characterize more cases of microbiota- 
mediated adaptations, including perhaps some 
of the examples suggested here. The heritabi- 
lity of specific taxa could also be investigated 
further to better characterize underlying genetic 
polymorphisms. Many, but not all, heritable 
taxa are heritable across several human popu- 
lations and even across species. These types of 
patterns could be further explored to under- 
stand the local contexts and selective forces. 
Heritability can apply to microbial functions 
as well as taxa, so extending the traits as- 
sociated with human genetic variation from 
taxa to functions through metagenomics should 
further help to clarify the role of microbiomes 
in human adaptation. We have here focused 
on the evolution of the host (and mostly on 
humans), but microbes also rapidly evolve in 
response to their host environments. A better 
understanding of reciprocal interactions be- 
tween the host genome and microbiota in the 
context of adaptive evolution will elucidate 
how humans and other animals, together with 
their microbiotas, have adapted to a changing 
world. 
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INTRODUCTION: The emergence of three lethal 
coronaviruses in <20 years and the urgency of 
the COVID-19 pandemic have prompted ef- 
forts to develop new therapeutic strategies, in- 
cluding by repurposing existing agents. After 
performing a comparative analysis of the three 
pathogenic human coronaviruses severe acute 
respiratory syndrome coronavirus 1 (SARS- 
CoV-1), SARS-CoV-2, and Middle East respira- 
tory syndrome coronavirus (MERS-CoV), we 
identified shared biology and host-directed 
drug targets to prioritize therapeutics with 
potential for rapid deployment against cur- 
rent and future coronavirus outbreaks. 
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RATIONALE: Expanding on our recent SARS- 
CoV-2 interactome, we mapped the virus-host 
protein-protein interactions for SARS-CoV-1 
and MERS-CoV and assessed the cellular lo- 
calization of each viral protein across the 
three strains. We conducted two genetic 
screens of SARS-CoV-2 interactors to priori- 
tize functionally-relevant host factors and 
structurally characterized one virus-host in- 
teraction. We then tested the clinical rele- 
vance of three more host factors by assessing 
risk in genetic cohorts or observing effective- 
ness of host factor-targeting drugs in real- 
world evidence. 
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Overview of the approaches taken for systemic and functional comparison of pathogenic human 
coronaviruses. (Left) Viral-human protein-protein interaction network mapping, viral protein localization studies, 
and functional genetic screens provide key insights into the shared and individual characteristics of each virus. 
(Right) Structural studies and hypothesis testing in clinical datasets demonstrate the utility of this approach for 
prioritizing therapeutic strategies. Nsp, nonstructural protein; ORF, open reading frame; ER, endoplasmic reticulum. 
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RESULTS: Quantitative comparison of the 389 
interactors of SARS-CoV-2, 366 of SARS-CoV-1, 
and 296 of MERS-CoV highlighted interactions 
with host processes that are conserved across 
all three viruses, including where nonortholo- 
gous proteins from different virus strains seem 
to fill similar roles. We also localized each 
individually-expressed viral protein by micros- 
copy and then raised and validated antisera 
against 14 SARS-CoV-2 proteins to determine 
their localization during infection. 

On the basis of two independent genetic 
perturbation screens, we identified 73 host fac- 
tors that, when depleted, caused significant 
changes in SARS-CoV-2 replication. From this 
list of potential drug targets, we validated the 
biological and clinical relevance of Tom70, 
IL17RA, PGES-2, and SigmaR1. 

A 3-A cryo-electron microscopy structure of 
Tom70, a mitochondrial import receptor, in 
complex with SARS-CoV-2 ORF9b, provides in- 
sight into how ORF9b may modulate the host 
immune response. Using curated genome-wide 
association study data, we found that individ- 
uals with genotypes corresponding to higher 
soluble IL17RA levels in plasma are at decreased. 
risk of COVID-19 hospitalization. 

To demonstrate the value of our data for drug 
repurposing, we identified SARS-CoV-2 patients 
who were prescribed drugs against prioritized 
targets and asked how they fared compared with 
carefully matched patients treated with clinically 
similar drugs that do not inhibit SARS-CoV-2. 
Both indomethacin, an inhibitor of host factor 
PGES-2, and typical antipsychotics, selected for 
their interaction with sigma receptors, showed 
effectiveness against COVID-19 compared with 
celecoxib and atypical antipsychotics, respectively. 


CONCLUSION: By employing an integrative and 
collaborative approach, we identified conserved 
mechanisms across three pathogenic corona- 
virus strains and further investigated potential 
drug targets. This versatile approach is broadly 
applicable to other infectious agents and dis- 
ease areas. 
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The COVID-19 pandemic, caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), 
is a grave threat to public health and the global economy. SARS-CoV-2 is closely related to 

the more lethal but less transmissible coronaviruses SARS-CoV-1 and Middle East respiratory 
syndrome coronavirus (MERS-CoV). Here, we have carried out comparative viral-human protein- 
protein interaction and viral protein localization analyses for all three viruses. Subsequent functional 
genetic screening identified host factors that functionally impinge on coronavirus proliferation, 
including Tom/0O, a mitochondrial chaperone protein that interacts with both SARS-CoV-1 and SARS- 
CoV-2 ORF9b, an interaction we structurally characterized using cryo—electron microscopy. 
Combining genetically validated host factors with both COVID-19 patient genetic data and medical 
billing records identified molecular mechanisms and potential drug treatments that merit further 
molecular and clinical study. 
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n the past two decades, three deadly 
human respiratory syndromes associated 
with coronavirus (CoV) infections have em- 
erged: severe acute respiratory syndrome 
(SARS) in 2002, Middle East respiratory 
syndrome (MERS) in 2012, and COVID-19 in 
2019. These three diseases are caused by the 
zoonotic coronaviruses severe acute respiratory 
syndrome coronavirus 1 (SARS-CoV-1), Middle 
East respiratory syndrome coronavirus (MERS- 
CoV), and SARS-CoV-2 (J), respectively. Before 
their emergence, human coronaviruses were 
associated with usually mild respiratory illness. 
To date, SARS-CoV-2 has sickened millions and 
killed more than 1 million people worldwide. 
This unprecedented challenge has prompted 
widespread efforts to develop vaccine and anti- 
viral strategies, including repurposed therapeu- 
tics, which offer the potential for treatments 
with known safety profiles and short develop- 
ment timelines. The successful repurposing of 
the antiviral nucleoside analog Remdesivir (2) 
as well as the host-directed anti-inflammatory 
steroid dexamethasone (3) provide clear proof 
that existing compounds can be crucial tools 
in the fight against COVID-19. Despite these 
promising examples, there is still no curative 
treatment for COVID-19. Additionally, as with 
any virus, the search for effective antiviral 
strategies could be complicated over time by 
the continued evolution of SARS-CoV-2 and 
possible resulting drug resistance (4). 

Current endeavors are appropriately focused 
on SARS-CoV-2 because of the severity and 
urgency of the ongoing pandemic. However, 
the frequency with which highly virulent 
coronavirus strains have emerged highlights 
an additional need to identify promising tar- 
gets for broad coronavirus inhibitors with 
high barriers to resistance mutations and the 
potential for rapid deployment against future 
emerging strains. Although traditional anti- 
virals target viral enzymes that are often sub- 
ject to mutation and thus the development of 
drug resistance, targeting the host proteins 
required for viral replication is a strategy that 
can avoid resistance and lead to therapeutics 
with the potential for broad-spectrum activity 
because families of viruses often exploit com- 
mon cellular pathways and processes. 

Here, we identified shared biology and po- 
tential drug targets among the three highly 
pathogenic human coronavirus strains. We 
expanded on our recently published map of 
virus-host protein interactions for SARS-CoV-2 
(5) and mapped the full interactomes of SARS- 
CoV-1 and MERS-CoV. We investigated the lo- 
calization of viral proteins across strains and 
quantitatively compared the virus-human in- 
teractions for each virus. Using functional 
genetics and structural analysis of selected host- 
dependency factors, we identified drug targets 
and performed real-world analysis on clinical 
data from COVID-19 patients. 
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A cross-coronavirus study of protein function 
A central goal of this study is to understand, 
from a systems level, the conservation of tar- 
get proteins and cellular processes between 
SARS-CoV-2, SARS-CoV-1, and MERS-CoV, 
thereby identifying shared vulnerabilities 
that can be targeted with antiviral therapeu- 
tics. All three strains encode four homologous 
structural proteins (E, M, N, and S) and 16 
nonstructural proteins (Nsps). The latter are 
proteolytically cleaved from a polyprotein 
precursor that is expressed from one large 
open reading frame (ORF), ORFlab (Fig. 1A). 
Additionally, coronaviruses contain a vari- 
able number of accessory factors encoded by 
ORFs. Although the genome organization and 
sequence of ORFlab is mainly conserved be- 
tween the three viruses under study, it di- 
verges markedly in the region encoding the 
accessory factors, especially between MERS- 
CoV and the two SARS coronaviruses (Fig. 
1, A to D, and table S1). These differences in 
conservation of genes and genome organiza- 
tion are linked to differences in host-targeting 
systems that we have studied through large- 
scale protein localization and interaction pro- 
filing (Fig. 1E). Building on our earlier work 
on the interactome of SARS-CoV-2 (5), we id- 
entified the host factors physically interacting 
with each SARS-CoV-1 and MERS-CoV viral 
protein. To this end, structural proteins, ma- 
ture Nsps, and predicted ORF proteins were 
codon optimized, 2xStrep tagged, and cloned 
into a mammalian expression vector (figs. S1 
and S82; see below and Materials and meth- 
ods section). Each protein construct was 
transfected into HEK293T cells and affinity 
purified, and high-confidence interactors were 
identified by mass spectrometry (MS) and 
scored using SAINTexpress (significance anal- 
ysis of interactome) and MiST (mass spectro- 
metry interaction statistics) scoring algorithms 
(6, 7) (table S2 and figs. S3 to S6). Addi- 
tionally, we performed MS analysis on SARS- 
CoV-2 Nsp16, which was not analyzed in our 
earlier work (5) (table S2 and fig. S7). In all, 
we now report 389 high-confidence inter- 
actors for SARS-CoV-2, 366 interactors for 
SARS-CoV-1, and 296 interactors for MERS- 
CoV (table S2). 


Conserved coronavirus proteins often retain 
the same cellular localization 


As protein localization can provide important 
information regarding function, we assessed 
the cellular localization of individually ex- 
pressed coronavirus proteins in addition to 
mapping their interactions (Fig. 2A and Mate- 
rials and methods). Immunofluorescence locali- 
zation analysis of all 2xStrep-tagged SARS-CoV-2, 
SARS-CoV-1, and MERS-CoV proteins high- 
lights similar patterns of localization for most 
shared protein homologs in HeLaM cells (Fig. 
2B), which supports the hypothesis that con- 
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served proteins share functional similarities. A 
notable exception is Nsp13, which appears to 
localize to the cytoplasm for SARS-CoV-2 and 
SARS-CoV-1, but to the mitochondria for MERS- 
CoV (Fig. 2B, figs. S8 to S13, and table S3). To 
assess the localization of SARS-CoV-2 proteins 
in the context of infected cells, we raised anti- 
bodies against 20 SARS-CoV-2 proteins and 
validated them with the individually expressed 
2xStrep-tagged proteins (fig. S14). Using the 
14 antibodies with confirmed specificity, we 
observed that localization of viral proteins in 
infected Caco-2 cells sometimes differed from 
their localization when expressed individually 
(Fig. 2B, fig. S15, and table S3). This likely 
results from recruitment of viral proteins and 
complexes into replication compartments, as 
well as from remodeling of the secretory path- 
way during viral infection. Such differences 
could also result from mislocalization caused 
by protein tagging. For example, the localiza- 
tion of expressed ORF7B does not match the 
known SARS-CoV-1 Golgi localization seen in 
the infection state. For proteins such as Nsp1 
and ORF3a, which are not known to be in- 
volved in viral replication, their localization is 
consistent both when expressed individually 
and in the context of viral infection (Fig. 2, C 
and D). We have compared the localization 
of the expressed viral proteins with the local- 
ization of their interaction partners using a 
cellular compartment gene ontology (GO) en- 
richment analysis (fig. S16). Several examples 
exist where the localization of the viral protein 
is in agreement with the localization of the 
interaction partners, including enrichment of 
the nuclear pore for Nsp9 interactors and 
endoplasmic reticulum (ER) enrichment for 
interactions with ORF8. 

Our localization studies suggest that most 
orthologous proteins have the same localiza- 
tion across the viruses (Fig. 2B). Moreover, 
small changes in localization, as observed for 
some viral proteins across strains, do not co- 
incide with strong changes in virus-host pro- 
tein interactions (Fig. 2E). Overall, these results 
suggest that changes in protein localization, 
as measured by expressed, tagged proteins, 
are not common and therefore are unlikely 
to be a major source of differences in host- 
targeting mechanisms. 


Comparison of host-targeted processes 
identifies conserved mechanisms with 
divergent implementations 


To study the conservation of targeted host 
factors and processes, we first used a cluster- 
ing approach (Materials and methods) to com- 
pare the overlap in protein interactions for 
the three viruses (Fig. 3A). We defined seven 
clusters of virus-host interactions correspond- 
ing to those that are specific to each virus or are 
shared among sets of viruses. The largest pair- 
wise overlap was observed between SARS-CoV-1 
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and SARS-CoV-2 (Fig. 3A), as is expected from 
their closer evolutionary relationship. A func- 
tional enrichment analysis (Fig. 3B and table 
S4) highlighted host processes that are tar- 
geted through interactions conserved across 
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all three viruses, including ribosome biogene- 
sis and regulation of RNA metabolism. Con- 
served interactions between SARS-CoV-1 and 
SARS-CoV-2—but not MERS-CoV—were en- 
riched in endosomal and Golgi vesicle trans- 
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port (Fig. 3B). Despite the small fraction (7.1%) 
of interactions conserved between SARS-CoV-1 
and MERS-CoV—but not SARS-CoV-2—these 
were strongly enriched in translation initiation 
and myosin complex proteins (Fig. 3B). 

We next asked whether the conserved in- 
teractions were specific for certain viral pro- 
teins (Fig. 3C) and found that some proteins 
(M, N, Nsp7, Nsp8, and Nsp13) showed a dis- 
proportionately high fraction of shared inter- 
actions conserved across the three viruses. 
This suggests that the processes targeted by 
these proteins may be more essential and 
more likely to be required for other emerging 
coronaviruses. Such differences in conserva- 
tion of interactions should be encoded, to some 
extent, in the degree of sequence differences. 
Comparing pairs of homologous proteins 
shared between SARS-CoV-2 and SARS-CoV-1 
or MERS-CoV, we observed a significant cor- 
relation between sequence conservation and 
protein-protein interaction (PPI) similarity 
(calculated as Jaccard index) [Fig. 3D; corre- 
lation coefficient (r) = 0.58, P = 0.0001]. This 
shows that the evolution of protein sequences 
strongly determines the divergence in virus- 
host interactions. 

While studying the function of host pro- 
teins interacting with each virus, we noted 
that some shared cellular processes were tar- 
geted by different interactions across the 
viruses. To study this in more detail, we iden- 
tified the cellular processes significantly en- 
riched in the interactomes of all three viruses 
(fig. S17A and table S4) and ranked them by 
the degree of overlapping proteins (Fig. 3E). 
This identified proteins related to the nuclear 
envelope, proteasomal catabolism, cellular re- 
sponse to heat, and regulation of intracellular 
protein transport as biological functions that 
are hijacked by these viruses through differ- 
ent human proteins. Additionally, we found 
that up to 51% of protein interactions with a 
conserved human target occurred via a dif- 
ferent (nonorthologous) viral protein (Fig. 3F), 
and, in some cases, the overlap of interac- 
tions for two nonorthologous virus baits 
was greater than that for the orthologous pair 
(Fig. 3G and fig. S17, B and C). For example, 
several interacting proteins of SARS-CoV-2 
Nsp8 are also targeted by MERS-CoV ORF4a, 
and interactions of MERS-CoV ORF5 share 
interactors with SARS-CoV-2 ORF3a (Fig. 3G). 
In the case of Nsp8, we found some degree 
of structural homology between its C-terminal 
region and a predicted structural model of 
ORF4a (Materials and methods and fig. S17D), 
which is indicative of a possible common in- 
teraction mechanism. 

We find that sequence differences determine 
the degree of changes in virus-host interactions 
and that often the same cellular process can 
be targeted by different viral or host proteins. 
These results suggest a degree of plasticity in 
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localization of Strep-tagged SARS-CoV-2, SARS-CoV-1, and MERS-CoV proteins in HeLaM cells (left) or of 
viral proteins upon SARS-CoV-2 infection in Caco-2 cells (right). (B) Relative localization for all coronavirus 
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the way that these viruses can control a given 
biological process in the host cell. 


Quantitative differential interaction 
scoring identifies interactions conserved 
between coronaviruses 


The identification of virus-host interactions con- 
served across pathogenic coronaviruses pro- 
vides the opportunity to reveal host targets 
that may remain essential for these and other 
emerging coronaviruses. For a quantitative 
comparison of each virus-human interaction 
from viral baits shared by all three viruses, 
we developed a differential interaction score 
(DIS). A DIS is calculated between any pair of 
viruses and is defined as the difference be- 
tween the interaction scores (K) from each 
virus (Fig. 4A, table S5, and Materials and 
methods). This kind of comparative analysis is 
beneficial as it permits the recovery of con- 


Gordon ez al., Science 370, eabe9403 (2020) 


served interactions that may fall just below 
strict cutoffs. For each comparison, a DIS was 
calculated for interactions residing in certain 
clusters as defined in the previous analysis 
(see Fig. 3A). For example, for the SARS-CoV-2 
to MERS-CoV comparison, a DIS was com- 
puted for interactions residing in all clusters 
except cluster 3, where interactions are either 
not found or scores were very low for both 
SARS-CoV-2 and MERS-CoV. A DIS of 0 indi- 
cates that the interaction is confidently shared 
between the two viruses being compared, 
whereas a DIS of +1 or -1 indicates that the 
host-protein interaction is specific for the virus 
listed first or second, respectively. 

In agreement with our previous results (Fig. 
3A), DIS values for the comparison between 
SARS-CoV-2 and SARS-CoV-1 are enriched near 
zero, which indicates a high number of shared 
interactions (Fig. 4B, yellow). By contrast, 


4. December 2020 


comparing interactions from either SARS- 
CoV-1 or SARS-CoV-2 with MERS-CoV resulted 
in DIS values closer to +1, which indicates a 
higher divergence (Fig. 4B, blue and green). 
The breakdown of DISs by homologous viral 
proteins reveals a high similarity of interac- 
tions for proteins N, Nsp8, Nsp7, and Nsp13 
(Fig. 4C), reinforcing the observations made 
by overlapping thresholded interactions (Fig. 3, 
C and D). As the greatest dissimilarity was ob- 
served between the SARS coronaviruses and 
MERS-CoV, we computed a fourth DIS (SARS- 
MERS) by averaging K from SARS-CoV-1 and 
SARS-CoV-2 before calculating the difference 
with MERS-CoV (Fig. 4, B and C, purple). We 
next created a network visualization of the 
SARS-MERS comparison (Fig. 4D), permitting 
an appreciation of SARS-specific (red; DIS near 
+1) versus MERS-specific (blue; DIS near -1) 
interactions as well as those conserved between 
all three coronavirus species (black; DIS near 
0). SARS-specific interactions include DNA 
polymerase a interacting with Nspl, stress 
granule regulators interacting with N protein, 
TLE transcription factors interacting with 
Nsp13, and AP2 clathrin interacting with Nsp10. 
Notable MERS-CoV-specific interactions in- 
clude mammalian target of rapamycin (mTOR) 
and Stat3 interacting with Nsp1; DNA dam- 
age response components p53 (TP53), MRE1I, 
RAD5O, and UBR3S interacting with Nsp14; and 
the activating signal cointegrator 1 (ASC-1) 
complex interacting with Nsp2. Interactions 
shared between all three coronaviruses in- 
clude casein kinase II and RNA processing 
regulators interacting with N protein; inosine 
5'-monophosphate (IMP) dehydrogenase 2 
UMPDH2) interacting with Nsp14; centrosome, 
protein kinase A, and TBK1 interacting with 
Nsp13; and the signal recognition particle, 
7SK small nuclear ribonucleoprotein (snRNP), 
exosome, and ribosome biogenesis components 
interacting with Nsp8 (Fig. 4D). 


Cell-based genetic screens identify SARS-CoV-2 
host-dependency factors 


To identify host factors that are critical for 
infection and therefore potential targets for 
host-directed therapies, we performed genetic 
perturbations of 332 human proteins—331 
previously identified to interact with SARS- 
CoV-2 proteins (5) plus ACE2—and observed 
their effect on infectivity. To ensure a broad 
coverage of potential hits, we carried out two 
screens in different cell lines, investigating 
the effects on infection: small interfering RNA 
(siRNA) knockdowns in A549 cells stably ex- 
pressing ACE2 (A549-ACE2) (Fig. 5A) and 
CRISPR-based knockouts in Caco-2 cells (Fig. 
5B). ACE2 was included as positive control in 
both screens as were nontargeting siRNAs or 
nontargeted Caco-2 cells as negative controls. 
After SARS-CoV-2 infection, effects on virus in- 
fectivity were quantified by real-time quantitative 
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score K). Included are only viral protein baits represented amongst all three viruses 
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Jaccard index 


comparing SARS-CoV-2 and SARS-CoV-1 (blue) or MERS-CoV (red). Interactions for 
PPI overlap are derived from the final thresholded list of interactions per virus. 

(E) GO biological process terms significantly enriched (g < 0.05) for all three virus 
PPIs with Jaccard index indicating overlap of genes from each term for pairwise 
comparisons between SARS-CoV-1 and SARS-CoV-2 (purple), SARS-CoV-1 and 
MERS-CoV (green), and SARS-CoV-2 and MERS-CoV (orange). (F) Fraction of shared 
preys between orthologous (blue) and nonorthologous (red) viral protein baits. 

(G) Heatmap depicting overlap in PPIs (Jaccard index) between each bait from 
SARS-CoV-2 and MERS-CoV. Baits in gray were not assessed, do not exist, or do not 
have high-confidence interactors in the compared virus. Nonorthologous bait 
interactions are highlighted with a red square. GO, gene ontology; PPI, protein- 
protein interaction; SARS2, SARS-CoV-2; SARS1, SARS-CoV-1; MERS, MERS-CoV. 


polymerase chain reaction (RT-qPCR) on cell 
supernatants (siRNA) or by titrating virus- 
containing supernatants on Vero E6 cells 
(CRISPR) (see Materials and methods for de- 
tails). Cells were monitored for viability, and 
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knockdown or editing efficiency was deter- 
mined as described (Materials and methods 
and fig. S18). This revealed that 93% of the 
genes were knocked down at least 50% in the 
A549-ACE2 screen, and 95% of the knock- 
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downs exhibited a <20% decrease in viability. 
In the Caco-2 assay, we observed an editing 
efficiency of at least 80% for 89% of the genes 
tested (Materials and methods and fig. S18). Of 
the 332 human SARS-CoV-2 interactors, the 
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Fig. 4. Comparative differential interaction analysis reveals shared 
virus-host interactions. (A) Flowchart depicting calculation of DIS values using 
the average between the SAINT and MiST scores between every bait (/) and prey 
(j) to derive interaction score (K). The DIS is the difference between the 
interaction scores from each virus. The modified DIS (SARS-MERS) compares 
the average K from SARS-CoV-1 and SARS-CoV-2 to that of MERS-CoV (see 
Materials and methods). Only viral bait proteins shared between all three viruses 
are included. (B) Density histogram of the DISs for all comparisons. (C) Dot 
plot depicting the DISs of interactions from viral bait proteins shared between 
all three viruses, ordered left to right by the mean DIS per viral bait. (D) Virus- 
human PPI map depicting the SARS-MERS comparison [purple in (B) and 

(C)]. The network depicts interactions derived from cluster 2 (all three viruses), 
cluster 4 (SARS-CoV-1 and SARS-CoV-2), and cluster 5 (MERS-CoV only). 


final A549-ACE2 dataset includes 331 gene 
knockdowns, and the Caco-2 dataset includes 
286 gene knockouts, with the difference main- 
ly owing to the removal of essential genes 
(Materials and methods). The readouts from 
both assays were then separately normalized 
using robust g-scores (Materials and methods), 
with negative and positive g-scores indicating 
proviral-dependency factors (perturbation leads 
to decreased infectivity) and antiviral host fac- 
tors with restrictive activity (perturbation leads 
to increased infectivity), respectively. As expected, 
negative controls resulted in neutral g-scores 
(Fig. 5, C and D, and tables S6 and S7). Sim- 
ilarly, perturbations of the positive control 
ACE2 resulted in strongly negative g-scores 
in both assays (Fig. 5, C and D). Overall, the 
g-scores did not exhibit any trends related to 
viability, knockdown efficiency, or editing ef- 
ficiency (fig. S18). With a cutoff of |z| > 2 to 
highlight genes that notably affect SARS-CoV-2 
infectivity when perturbed, 31 and 40 depen- 
dency factors (g < —2) and 3 and 4 factors with 
restrictive activity (g > 2) were identified in 
A549-ACE2 and Caco-2 cells, respectively 
(Fig. 5E). Of particular interest are the host- 
dependency factors for SARS-CoV-2 infection, 
which represent potential targets for drug de- 
velopment and repurposing. For example, non- 
opioid receptor sigma 1 (sigma-1, encoded by 
SIGMARI) was identified as a functional host- 
dependency factor in both cell systems, in ag- 
reement with our previous report of antiviral 
activity for sigma receptor ligands (5). To pro- 
vide a contextual view of the genetics results, 
we generated a network that integrates the 
hits from both cell lines and the PPIs of their 
encoded proteins with SARS-CoV-2, SARS- 
CoV-1, and MERS-CoV proteins (Fig. 5F). Not- 
ably, we observed an enrichment of genetic 
hits that encode proteins interacting with viral 
Nsp7, which has a high degree of interactions 
shared across all three viruses (Fig. 3C). Prost- 
aglandin E synthase 2 (PGES2, encoded by 
PTGES2), for example, is a functional interac- 
tor of Nsp7 from SARS-CoV-1, SARS-CoV-2, and 
MERS-CoV. Other dependency factors were spe- 
cific to SARS-CoV-2, including interleukin-17 
(IL-17) receptor A (LI7ZRA), which interacts with 
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SARS-CoV-2 ORF8. We also identify depen- 
dency factors that are shared interactors be- 
tween SARS-CoV-1 and SARS-CoV-2 such as the 
aforementioned sigma receptor 1 (SIGMARI), 
which interacts with Nsp6, and the mitochon- 
drial import receptor subunit Tom’70 (TOMM70), 
which interacts with ORF9b. We will use these 
interactions to validate virus-host interactions 
(ORF8-IL17RA and ORF9b-Tom70), connect 
our systems biology data to evidence for the 
clinical impact of the host factors we identified 
(ILI7RA), and analyze outcomes of COVID-19 
patients treated with putative host-directed 
drugs against PGES-2 and sigma receptor 1. 


SARS ORF9Qb interacts with Tom70 


ORF9b of SARS-CoV-1 and SARS-CoV-2 were 
found to be localized to mitochondria upon 
overexpression as well as in SARS-CoV-2- 
infected cells. In line with this, the mitochon- 
drial outer membrane protein Tom’70 (encoded 
by TOMM70) is a high-confidence interactor 
of ORF9b in both SARS-CoV-1 and SARS-CoV-2 
interaction maps (Fig. 6A), and it acts as 
a host-dependency factor for SARS-CoV-2 
(Fig. 6B). Tom’70 falls below the scoring thres- 
hold as a putative interactor of MERS-CoV 
Nsp2, a viral protein not associated with mito- 
chondria (table S2). Tom70 is one of the ma- 
jor import receptors in the translocase of the 
outer membrane (TOM) complex that recog- 
nizes and mediates the translocation of mito- 
chondrial preproteins from the cytosol into 
the mitochondria in a chaperone-dependent 
manner (8). Additionally, Tom70 is involved 
in the activation of the mitochondrial antivi- 
ral signaling (MAVS) protein, which leads to 
apoptosis upon viral infection (9, 10). 

To validate the interaction between viral 
proteins and Tom70, we performed a coim- 
munoprecipitation experiment in the pres- 
ence or absence of Strep-tagged ORF9b from 
SARS-CoV-1 and SARS-CoV-2 as well as Strep- 
tagged Nsp2 from all three coronaviruses. 
Endogenous Tom70—but not other translo- 
case proteins of the outer membrane including 
Tom20, Tom22, and Tom40—coprecipitated 
only in the presence of ORF9b but not Nsp2 in 
both HEK293T and A549 cells, which con- 
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Edge color denotes DIS: red indicates interactions specific to SARS-CoV-1 
and SARS-CoV-2 but absent in MERS-CoV; blue indicates interactions specific 
to MERS-CoV but absent from both SARS-CoV-1 and SARS-CoV-2: and 

black indicates interactions shared between all three viruses. Human-human 
interactions (thin dark gray line) and proteins sharing the same protein 
complexes or biological processes (light yellow or light blue highlighting, 
respectively) are shown. Host-host physical interactions, protein complex 
definitions, and biological process groupings are derived from CORUM 
(46), GO (biological process), and manually curated from literature sources. 
Thin dashed gray lines are used to indicate the placement of node labels 
when adjacent node labels would have otherwise been obscured. DIS, 
differential interaction score; SARS2, SARS-CoV-2; SARSI, SARS-CoV-1: 
MERS, MERS-CoV; SARS, both SARS-CoV-1 and SARS-CoV-2. 


firms our affinity purification-mass spectrom- 
etry (AP-MS) data and suggests that ORF9b 
specifically interacts with Tom’70 (Fig. 6C and 
fig. SI9A). Further, upon coexpression in bac- 
terial cells, we were able to copurify the 
ORF9b-Tom70 protein complex, which indi- 
cates a stable complex (Fig. 6D). We found 
that SARS-CoV-1 and SARS-CoV-2 ORF9b ex- 
pressed in HeLaM cells colocalized with Tom70 
(Fig. 6E) and observed that SARS-CoV-1 or 
SARS-CoV-2 ORF9b overexpression led to de- 
creases in Tom70 expression (Fig. 6, E and F). 
Similarly, ORF9b was found to colocalize with 
Tom70 on SARS-CoV-2 infection (Fig. 6G). 
This is in agreement with the known outer 
mitochondrial membrane localization of Tom70 
(11) and ORF9b localization to mitochondria 
upon overexpression and during SARS-CoV-2 
infection (Fig. 2B). We also saw decreases in 
Tom70 expression during SARS-CoV-2 infec- 
tion (Fig. 6G) but did not see pronounced 
changes in expression levels of the mitochon- 
drial protein Tom20 after individual Strep- 
ORF9b expression or upon SARS-CoV-2 infection 
(fig. S19, B and C). 


Cryo-electron microscopy structure of 
ORF9b-Tom70 complex reveals ORF9b interacting 
at the substrate binding site of Tom70 


Tom70, as part of the TOM complex, is in- 
volved in the recognition of mitochondrial 
preproteins from the cytosol (72). To further 
understand the molecular details of ORF9b- 
Tom70 interactions, we obtained a 3-A cryo- 
electron microscopy (cryo-EM) structure of 
the ORF9b-Tom70 complex (Fig. 7A and fig. 
S20). Notably, although purified proteins failed 
to interact upon attempted in vitro complex 
reconstitution, they yielded a stable and pure 
complex when coexpressed in Escherichia coli 
(Fig. 6D). This may be because of the fact that 
ORF9b alone purifies as a dimer (as inferred 
by the apparent molecular weight on size- 
exclusion chromatography) and would need to 
dissociate to interact with Tom’70 on the basis 
of our structure. Tom70 preferentially binds 
preproteins with internal hydrophobic target- 
ing sequences (13). It contains an N-terminal 
transmembrane domain and tetratricopeptide 


7 of 25 


RESEARCH | RESEARCH ARTICLE 




















A si i -Cov- B -CoV-2 infecti 
ae gras z Data collection p lial vet Viability and titration Data collection 
ulm" Virus quantification r ee. - diie 
y | > —> (SARS-CoV-2 RT-qPCR) & ey pales > e o > i >  quantificati on 
a Cell viabilit A titrations (SARS-CoV-2 anti N, 
at y ae, as foci counting) 
= Cells f or, 
Knockdown efficiency viability en —_, ee DAPI stain 
9 (viability) 
-24 h 72h 72h 96 h 
C E 
5— Infectivity in A549-ACE2 cells Infectivity in A549-ACE2 
i and Caco-2 cells 
20 -10 -5 
D AN a cccnnne | @ © d © 
N A549-ACE2 RAB2A — © 
-5 Z-score NGDN~ / k 
SIGMAR1 r © e? OO 
ATP6AP1 \ ° 9.46 
PPT1 
O 
O 
ACE2 4715 
O 
Caco-2 Z-score — -20 






DPH5 





RBM28 


UBAP2L 
PPIL3 PABPC1 


& SARS-CoV-2 Protein 





ATP6AP1 
SIGMAR1 


& SARS-CoV-1 Protein 


Fig. 5. Functional interrogation of SARS-CoV-2 interactors using genetic 
perturbations. (A) A549-ACE2 cells were transfected with siRNA pools 
targeting each of the human genes from the SARS-CoV-2 interactome, followed 

by infection with SARS-CoV-2 and virus quantification using RT-qPCR. Cell viability 
and knockdown efficiency in uninfected cells was determined in parallel. 

(B) Caco-2 cells with CRISPR knockouts (KO) of each human gene from the 
SARS-CoV-2 interactome were infected with SARS-CoV-2, and supernatants were 
serially diluted and plated onto Vero E6 cells for quantification. Viabilities of 
the uninfected CRISPR knockout cells after infection were determined in parallel 
by DAPI staining. (C and D) Plot of results from the infectivity screens in 
A549-ACE2 knockdown cells (C) and Caco-2 knockout cells (D) sorted by z-score 
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(nontargeting control for siRNA, nontargeted cells for CRISPR) and positive 
controls (ACE2 knockdown or knockout) are highlighted. (E) Results from 
both assays with potential hits (|z| > 2) highlighted in red (A549-ACE2), 
yellow (Caco-2), and orange (both). (F) Pan-coronavirus interactome reduced 
to human preys with significant increase (red nodes) or decrease (blue 
nodes) in SARS-CoV2 replication upon knockdown or knockout. Viral proteins 
baits from SARS-CoV-2 (red), SARS-CoV-1 (orange), and MERS-CoV (yellow) 
are represented as diamonds. The thickness of the edge indicates the 
strength of the PPI in spectral counts. KD, knockdown; KO, knockout; PPI, 
protein-protein interaction. 


repeat (TPR) motifs in its cytosolic segment. 
The C-terminal TPR motifs recognize the in- 
ternal mitochondrial targeting signals (MTSs) 
of preproteins, and the N-terminal TPR clamp 
domain serves as a docking site for multi- 
chaperone complexes that contain preprotein 
(14, 15). Obtained cryo-EM density allowed us 
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to build atomic models for residues 109 to 600 
of human Tom’70 and residues 39 to 76 of SARS- 
CoV-2 ORF9b (Fig. 7A and table S8). ORF9b 
makes extensive hydrophobic interactions at 
the pocket on Tom70 that have been impli- 
cated in its binding to MTS, with the total 
buried surface area at the interface being quite 
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extensive—~2000 A? (Fig. 7B). In addition to 
the mostly hydrophobic interface, four salt 
bridges further stabilize the interaction (Fig. 
7C). On interaction with ORF9b, the interact- 
ing helices on Tom70 move inward to tightly 
wrap around ORF9b as compared with previous- 
ly crystallized yeast Tom70 homologs (movie S1). 
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Fig. 6. Interaction between ORF9b and human Tom70. (A) ORF9b-Tom70 
interaction is conserved between SARS-CoV-1 and SARS-CoV-2. (B) Viral titers 
in Caco-2 cells after CRISPR knockout of TOMM70 or controls. (C) Coimmuno- 
precipitation of endogenous Tom70 with Strep-tagged ORF9b from SARS-CoV-1 
and SARS-CoV-2; Nsp2 from SARS-CoV-1, SARS-CoV-2, and MERS-CoV; or vector 
control in HEK293T cells. Representative blots of whole-cell lysates and eluates after 
IP are shown. (D) Size exclusion chromatography traces (10/300 S200 increase) 
of ORF9b alone, Tom/0 alone, and coexpressed ORF9b-Tom/0 complex 
purified from recombinant expression in E. coli. Insert shows SDS-PAGE of the 
complex peak indicating presence of both proteins. (E) Immunostainings for 
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Tom70 in HeLaM cells transfected with GFP-Strep and ORF9b from SARS- 
CoV-1 and SARS-CoV-2 (left) and mean fluorescence intensity + SD values 
of Tom/0 in GFP-Strep and ORF9b expressing cells (normalized to 
nontransfected cells) (right). Scale bar, 10 um. (F) Flag-Tom/0 expression 
levels in total cell lysates of HEK293T cells upon titration of cotransfected 
Strep-ORF9b from SARS-CoV-1 and SARS-CoV-2. (G) Immunostaining for 
ORF9b and Tom/0 in Caco-2 cells infected with SARS-CoV-2 (left) and mean 
fluorescence intensity + SD values of Tom/0O in uninfected and SARS-CoV-2- 
infected cells (right). SARS2, SARS-CoV-2; SARS1, SARS-CoV-1; MERS, 
MERS-CoV; IP, immunoprecipitation. **P < 0.05, Student's t test. 
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Fig. 7. Cryo-EM structure of ORF9b-lom70 complex reveals ORF9b adopting 
a helical fold and binding at the substrate recognition site of Tom/0. 

(A) Surface representation of the ORF9b-Tom/O structure. Tom/O is depicted as 
molecular surface in green, ORF9b is depicted as ribbon in orange. Region in 
charcoal indicates Hsp/70 or Hsp90 binding site on Tom70. (B) Magnified view 
of ORF9b-Tom/0 interactions with interacting hydrophobic residues on Tom/0 
indicated and shown in spheres. The two phosphorylation sites on ORF9b, 

S50 and S53, are shown in yellow. (C) lonic interactions between Tom/0 and ORF9b 
are depicted as sticks. Highly conserved residues on Tom/0 making hydrophobic 
interactions with ORF9b are depicted as spheres. (D) Diagram depicting 
secondary structure comparison of ORF9b as predicted by JPred server—as 





1.0 
40 50 60 70 oe 
PKVYPIILRLGSPLSLNMARKTLNSLEDKAFQLTPIAV = 0.8 6 T 
= 0 e 
j j ÃĂ— i - Q o 
Prediction —=—=® ——— G — E ——— Bog í ° 
o ° 
E A @ 
With Tom70 —————— a y ° e. e, 
E ee © ë 
0.2 è s 8 ° 


As a dimer ąB PD am = l wa °’ i Au 





0 10 20 30 40 50 60 
Residues truncated from N terminus 


visualized in our structure—or as visualized in the previously crystallized dimer 
structure (PDB ID: 6Z4U) (16). Pink tubes indicate helices, charcoal arrows 
indicate beta strands, and the amino acid sequence for the region visualized in 
the cryo-EM structure is shown on top. (E) Predicted probability of having an 
internal MTS as output by TargetP server by serially running N-terminally 
truncated regions of SARS-CoV-2 ORF9b. Region visualized in the cryo-EM 
structure (amino acids 39 to 76) overlaps with the highest internal MTS 
probability region (amino acids 40 to 50). MTS, mitochondrial targeting signal. 
Single-letter abbreviations for the amino acid residues are as follows: A, Ala; C, 
Cys; D, Asp; E, Glu; F, Phe; G, Gly; H, His; l, Ile; K, Lys; L, Leu; M, Met; N, Asn; 
P, Pro; Q, Gln; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and Y, Tyr. 


No structure for human Tom70 without a sub- 
strate has been reported to date, and therefore 
we cannot rule out the idea that the confor- 
mational differences are because of differences 
between homologs. However, it is possible that 
this conformational change upon substrate 
binding is conserved across homologs, as many 
of the Tom70 residues interacting with ORF9b 
are highly conserved, which likely indicates 
residues essential for endogenous MTS sub- 
strate recognition. 

Although a previously published crystal 
structure of SARS-CoV-2 ORF9b revealed that 
it entirely consists of beta sheets [Protein Data 
Bank (PDB) ID: 6Z4U] (16), we observed that, 
upon binding Tom70 residues 52 to 68, ORF9b 
forms a helix (Fig. 7D). This is consistent with 
the fact that MTS sequences recognized by 
Tom70 are usually helical, and analysis with 
the TargetP MTS prediction server revealed 
a high probability for this region of ORF9b to 
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have an MTS (Fig. 7E). This shows structural 
plasticity in this viral protein where, depend- 
ing on the binding partner, ORF9b changes 
between helical and beta strand folds. Fur- 
thermore, we had previously identified two 
infection-driven phosphorylation sites on 
ORF9b, S50 and S53 (17), which map to the 
region on ORF9b buried deep in the Tom70 
binding pocket (Fig. 7B, yellow). S53 con- 
tributes two hydrogen bonds to the interac- 
tion with Tom70 in this overall hydrophobic 
region. Therefore, once phosphorylated, it is 
likely that the ORF9b-Tom70 interaction is 
weakened. These residues are surface exposed 
in the dimeric structure of ORF9b, which 
could potentially allow phosphorylation to 
partition ORF9b between Tom70-bound and 
dimeric populations. 

The two binding sites on Tom70—the sub- 
strate binding site and the TPR domain that 
recognizes Hsp70 and Hsp90—are known to 
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be conformationally coupled (18). Tom70’s 
interaction with a C-terminal EEVD motif 
of Hsp90 via the TPR domain is key for its 
function in the interferon pathway and induc- 
tion of apoptosis on viral infection (J0, 19). 
Whether ORF9b, by binding to the substrate 
recognition site of Tom70, allosterically inhib- 
its Tom’70’s interaction with Hsp90 at the TPR 
domain remains to be investigated; but not- 
ably we observe in our structure that R192, a 
key residue in the interaction with Hsp70 and 
Hsp90, is moved out of position to interact 
with the EEVD sequence, which suggests that 
ORF9b may modulate interferon and apopto- 
sis signaling via Tom70 (fig. S21). Alternatively, 
Tom70 has been described as an essential 
import receptor for PTEN induced kinase 
1 (PINK1), and therefore the loss of mitochon- 
drial import efficiency as a result of ORF9b 
binding to the Tom’70 substrate binding pocket 
may induce mitophagy. 
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Implications of the ORF8-IL17RA interaction 
for COVID-19 

As described above, we found that IL-17 re- 
ceptor A (ILI7RA) physically interacts with 
ORF8 from SARS-CoV-2, but not SARS-CoV-1 
or MERS-CoV (Fig. 5D, table S2, and Fig. 8A). 
Several recent studies have identified high 
IL-17 levels or aberrant IL-17 signaling as a 
correlate of severe COVID-19 (20-23). We 
demonstrated that the physical interaction 
of SARS-CoV-2 ORF8 with ILI7RA occurs with 
or without IL-17A treatment, which suggests 
that signaling through the receptor does not 
disrupt the interaction with ORF8 (Fig. 8B). 
Furthermore, knockdown of IL17RA led to a 
significant decrease in SARS-CoV-2 viral rep- 
lication in A549-ACE2 cells (Fig. 8C). These 
data suggest that the ORF8-IL17RA interac- 
tion modulates systemic IL-17 signaling. 

One manner in which this signaling is reg- 
ulated is through the release of the extracel- 
lular domain of the receptor as soluble IL17RA 
(sILI7RA), which acts as a decoy in circulation 
by soaking up IL-17A and inhibiting IL-17 sig- 
naling (24). Production of sILI7RA has been 
demonstrated by alternative splicing in cul- 
tured cells (25), but the mechanism by which 
IL17RA is shed in vivo remains unclear (26). 
ADAM family metalloproteases are known 
to mediate the release of other interleukin 
receptors into their soluble form (27). We 
found that SARS-CoV-2 ORF8 physically in- 
teracted with both ADAM9 and ADAMTSI1 in 
our previous study (5). We find that knock- 
down of ADAMS, like that of IL17RA, leads 
to significant decreases in SARS-CoV-2 rep- 
lication in A549-ACE2 cells (Fig. 5D and 
table S2). 

To test the in vivo relevance of sIL17RA in 
modulating SARS-CoV-2 infection, we lever- 
aged a genome-wide association study (GWAS) 
which identified 14 single-nucleotide polymor- 
phisms (SNPs) near the JLI7RA gene that 
causally regulate sIL17RA plasma levels (28). 
We then used generalized summary-based 
Mendelian randomization (GSMR) (28, 29) on 
the curated GWAS datasets of the COVID-19 
Host Genetics Initiative (COVID-HGI) (30) 
and observed that genotypes that predicted 
higher sIL17RA plasma levels were associated. 
with lower risk of COVID-19 when compared 
with the population (Fig. 8D and table S9), 
which is seemingly consistent with our molec- 
ular data. Similar results were obtained when 
comparing only hospitalized COVID-19 patients 
to the population. However, there was no evi- 
dence of association in hospitalized versus non- 
hospitalized COVID-19 patients. Though the 
COVID-HGI dataset is underpowered and this 
observation needs to be replicated in other 
cohorts, the clinical observations, functional 
genetics, and clinical genetics all suggest that 
SARS-CoV-2 benefits from modulating IL-17 
signaling. One potentially contradictory caveat 
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Fig. 8. SARS-CoV-2 ORF8 and functional interactor IL17RA are linked to viral outcomes. (A) IL17RA 


and ADAMS9 are functional interactors of SARS-CoV-2 O 


RF8. Only interactors identified in the genetic 


screening are shown. (B) Coimmunoprecipitation of endogenous IL17RA with Strep-tagged ORF8 or EGFP 
with or without IL-17A treatment at different times. Overexpression was done in HEK293T cells. (C) Viral titer 
after ILI7RA or control knockdown in A549-ACE2 cells. (D) OR of membership in indicated cohorts by 


genetically predicted sIL17RA levels. SARS2, SARS-CoV- 
OR, odds ratio; Cl, confidence interval; sILI7RA, soluble 
(C) indicate SDs; in (D), they indicate 95% Cls. 


is that we find high-level IL-17A treatment 
diminishes SARS-CoV-2 replication in A549- 
ACE2 cells (fig. S22); however, IL-17 is a 
pleiotropic cytokine and it is likely to play 
multiple roles during SARS-CoV-2 infection in 
the context of a competent immune system. 

Infectious and transmissible SARS-CoV-2 vi- 
ruses with large deletions of ORF8 have arisen 
during the pandemic and have been associ- 
ated with milder disease and lower concentra- 
tions of proinflammatory cytokines (37). Notably, 
compared with healthy controls, patients in- 
fected with wildtype, but not ORF8-deleted 
virus, had threefold elevated plasma levels of 
IL-17A (37). More work will be needed to under- 
stand if and how ORF8 manipulates the IL-17 
signaling pathway during the course of SARS- 
CoV-2 infection. 


Investigation of druggable targets identified 
as interactors of multiple coronaviruses 


The identification of druggable host factors 
provides a rationale for drug repurposing ef- 
forts. Given the extent of the current pande- 
mic, real-world data can now be used to study 
the outcome of COVID-19 patients coinci- 
dentally treated with host factor-directed, 
U.S. Food and Drug Administration (FDA)- 
approved therapeutics. Using medical bill- 
ing data, we identified 738,933 patients in the 
United States with documented SARS-CoV-2 
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2; IP, immunoprecipitation; SD, standard deviation; 
ILI7RA. *P < 0.05, unpaired t test. Error bars in 


infection (Materials and methods). In this co- 
hort, we probed the use of drugs against tar- 
gets identified here that were shared across 
coronavirus strains and found to be function- 
ally relevant in the genetic perturbation screens. 
In particular, we analyzed outcomes for an 
inhibitor of prostaglandin E synthase type 2 
(PGES-2, encoded by PTGES2) and for po- 
tential ligands of sigma nonopioid receptor 
1 (sigma-l, encoded by SIGMARI), and inves- 
tigated whether these patients fared better 
than carefully matched patients treated with 
clinically similar drugs without predicted anti- 
coronavirus activity. 

PGES-2, an interactor of Nsp7 from all three 
viruses (Fig. 4D), is a dependency factor for 
SARS-CoV-2 (Fig. 5F). It is inhibited by the 
FDA-approved prescription nonsteroidal anti- 
inflammatory drug (NSAID) indomethacin. 
Computational docking of Nsp7 and PGES-2 
to predict binding configuration showed that 
the dominant cluster of models localizes Nsp7 
adjacent to the PGES-2-indomethacin binding 
site (fig. S23). However, indomethacin did not 
inhibit SARS-CoV-2 in vitro at reasonable anti- 
viral concentrations (fig. S24 and table S10). A 
previous study also found that similarly high 
levels of the drug were needed for inhibition 
of SARS-CoV-1 in vitro, but this study still 
showed efficacy for indomethacin against ca- 
nine coronavirus in vivo (32). This motivated 


11 of 25 


RESEARCH | RESEARCH ARTICLE 


us to observe outcomes in a cohort of out- 
patients with confirmed SARS-CoV-2 infec- 
tion who by happenstance initiated a course 
of indomethacin compared with those who 
initiated the prescription NSAID celecoxib, 


which lacks anti-PGES-2 activity. We com- 
pared the odds of hospitalization by risk-set 
sampling (RSS) patients treated at the same 
time and at similar levels of disease severity 
and then by further matching on propensity 


score (PS) (33) (Fig. 9A and table S11). RSS and 
PS—combined with a new user, active com- 
parator design that mimics the intervention- 
al component of parallel group randomized 
studies—are established design and analytic 


Further matching on propensity 
score (PS) based on 24 risk factors 


Risk set sampling (RSS) with 1:1 
matching of users on 7 key confounders 








New” outpatient users 
of indomethacin 
n = 244 







Individuals with 
documented 






*No use of either drug 
in previous 60 days 


SARS-CoV-2 infection 
n = 738,933 





New” outpatient users 
of celecoxib 
(comparator) 

n= 474 











B Users of Users of n per 
indomethacin celecoxib group ASAMD p-value 

Inpatient stay 1 3 103 0.054 0.34 
non 
oc 
z$ 
D z Any hospital visit 103 0.054 0.21 
© 


Inpatient stay 153 0.092 0.07 


Any hospital visit os 0.092 0.02 


RSS matching 
Only 





Q Q Q Q S Ñ 


N N 


Number of patients requiring 
hospitalization within 30 days 










New* users 
of typical antipsychotics 
n= 1,131 











Individuals hospitalized 
for COVID19 
n = 28,338 











*No use of any antipsychotic 
in previous 60 days 






New* users 
of atypical antipsychotics 
(comparator) 











n = 153 per group 





Propensity Score 


Risk set sampling (RSS) with 1:1 
matching of users on 6 key confounders 
n = 265 per group 





n = 103 per group 


Follow-up for 


hospitalization 








Propensity Score 


Indomethacin better Celecoxib better 


© ` y © y¥ © © O `s” Q y % 
2 o o o> o oo o K 


Odds ratio (95% CI) 


Further matching on propensity 
score (PS) based on 36 risk factors 
n = 186 per group 


Follow-up for 


mechanical 
ventilation 











n = 1,516 i 
Propensity Score Propensity Score 
Users of typical Users of atypical n per , , , Atypical anti- 
antipsychotics antipsychotics group ASAMD p-value Typical antipsychotics better psychotics better 
Risk set sampling and 186 0.053 0.03 
propensity score matching 
Risk set sampling only 
N 
& © = © 2 > Ss ` 3? S S S 


Number of patients requiring 
mechanical ventilation within 30 days 


Fig. 9. Real-world data analysis of drugs identified through molecular 
investigation support their antiviral activity. (A) Schematic of retrospective 
real-world clinical data analysis of indomethacin use for outpatients with SARS- 
CoV-2. Plots show distribution of propensity scores (PSs) for all included patients 
(red, indomethacin users; blue, celecoxib users). For a full list of inclusion, 
exclusion, and matching criteria, see Materials and methods and table S11. 

(B) Effectiveness of indomethacin versus celecoxib in patients with confirmed 
SARS-CoV-2 infection treated in an outpatient setting. Average standardized 
absolute mean difference (ASAMD) is a measure of balance between 
indomethacin and celecoxib groups calculated as the mean of the absolute 
standardized difference for each PS factor (table S11); P value and ORs with 95% 
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Odds ratio (95% Cl) 


Cls are estimated using the Aetion Evidence Platform r4.6. No ASAMD was >0.1. 
(C) Schematic of retrospective real-world clinical data analysis of typical 
antipsychotic use for inpatients with SARS-CoV-2. Plots show distribution of 
PSs for all included patients (red, typical users; blue, atypical users). For a full list 
of inclusion, exclusion, and matching criteria see Materials and methods and 
table S11. (D) Effectiveness of typical versus atypical antipsychotics among 
hospitalized patients with confirmed SARS-CoV-2 infection treated in hospital. 
ASAMD is a measure of balance between typical and atypical groups calculated 
as the mean of the absolute standardized difference for each PS factor (table 
S11); P value and ORs with 95% Cls are estimated using the Aetion Evidence 
Platform r4.6. No ASAMD was >0.1. 
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techniques that mitigate biases that can arise 
in observational studies. A complete list of risk 
factors used for matching, which include demo- 
graphic data, baseline health care utilization, 
comorbidities, and measures of disease sever- 
ity, are found in table S11. 

Among SARS-CoV-2-positive patients, new 
users of indomethacin in the outpatient set- 
ting were less likely than matched new users 
of celecoxib to require hospitalization or in- 
patient services [Fig. 9B; odds ratio (OR) = 
0.33; 95% confidence interval (CI): 0.03 to 
3.19]. The CI of our primary analysis included 
the null value. In sensitivity analyses, neither 
using the larger, risk-set-sampled cohort nor 
relaxing our outcome definition to include any 
hospital visit appreciably changed the inter- 
pretation of our findings, but it did narrow the 
CIs, particularly when both approaches were 
combined (OR = 0.25; 95% CI: 0.08 to 0.76). 
Although we acknowledge that this is a small, 
noninterventional study, it is nonetheless an 
example of how molecular insight can rapidly 
generate testable clinical hypotheses and help 
prioritize candidates for prospective clinical 
trials or future drug development. 

To create larger patient cohorts, we next 
grouped drugs that shared activity against the 
same target—sigma receptors. We previously 
identified sigma-1 and sigma-2 as drug targets 
in our SARS-CoV-2-human PPI map, and mul- 
tiple potent, nonselective sigma ligands were 
among the most promising inhibitors of SARS- 
CoV-2 replication in Vero E6 cells (5). As shown 
above, knockout and knockdown of SIGMARI, 
but not of SIGMAR2 (also known as TMEM97), 
led to robust decreases in SARS-CoV-2 repli- 
cation (fig. S24 and Fig. 5F), which suggests 
that sigma-l1 may be a key therapeutic target. 
We analyzed SIGMARI sequences across 359 
mammals and observed positive selection of 
several residues within beaked whale, mouse, 
and ruminant lineages, which may indicate a 
role in host-pathogen competition (fig. S25). 
Additionally, the sigma ligand drug amiodar- 
one inhibited replication of SARS-CoV-1 as 
well as SARS-CoV-2, consistent with the con- 
servation of the Nsp6-sigma-1 interaction across 
the SARS viruses (fig. S24 and Fig. 4D). We 
then looked for other FDA-approved drugs with 
reported nanomolar affinity for sigma receptors 
or those that fit the sigma ligand chemotype 
(5, 34-41), and we selected 13 such therapeu- 
tics. We find that all are potent inhibitors of 
SARS-CoV-2 with half-maximal inhibitory con- 
centration (ICs) values <10 uM, though there 
is a wide range in reported sigma receptor 
affinity with no clear correlation between 
sigma receptor binding affinity and antiviral 
activity (fig. S24:D). Several clinical drug classes 
were represented by more than one candidate, 
including typical antipsychotics and antihis- 
tamines. Over-the-counter antihistamines are 
not well represented in medical billing data 
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and are therefore poor candidates for real- 
world analysis, but users of typical antipsy- 
chotics can be easily identified in our patient 
cohort. By grouping these individual drug can- 
didates by clinical indication, we were able to 
build a better-powered comparison. 

We constructed a cohort for retrospective 
analysis on new, inpatient users of antipsy- 
chotics. In inpatient settings, typical and atyp- 
ical antipsychotics are used similarly, most 
commonly for delirium. We compared the 
effectiveness of typical antipsychotics, which 
have sigma activity and antiviral effects (fig. 
S24E), versus atypical antipsychotics, which 
do not have antiviral activity (fig. S24F), for 
treatment of COVID-19 (Fig. 9C). Observing 
mechanical ventilation outcomes in inpatient 
cohorts is a proxy for the worsening of severe 
illness rather than the progression from mild 
disease signified by the hospitalization of 
indomethacin-exposed outpatients above. We 
again used RSS plus PS to build a robust, di- 
rectly comparable cohort of inpatients (table 
S11). In our primary analysis, half as many of 
the new users of typical antipsychotics com- 
pared with the new users of atypical antipsy- 
chotics progressed to the point of requiring 
mechanical ventilation, demonstrating signif- 
icantly lower use with an OR of 0.46 (95% CI: 
0.23 to 0.93; P = 0.03; Fig. 9D). As above, we 
conducted a sensitivity analysis in the RSS- 
only cohort and observed the same trend (OR = 
0.56; 95% CI: 0.31 to 1.02; P = 0.06), which 
emphasizes the primary result of a beneficial 
effect for typical versus atypical antipsychotics 
observed in the RSS-plus-PS-matched cohort. 
Although a careful analysis of the relative ben- 
efits and risks of typical antipsychotics should 
be undertaken before considering prospective 
studies or interventions, these data and analy- 
ses demonstrate how molecular information 
can be translated into real-world implications 
for the treatment of COVID-19—an approach 
that can ultimately be applied to other dis- 
eases in the future. 


Discussion 


In this study, we generated and compared 
three different coronavirus-human PPI maps 
in an attempt to identify and understand pan- 
coronavirus molecular mechanisms. The use 
of a quantitative DIS allowed for the identifi- 
cation of virus-specific as well as shared inter- 
actions among distinct coronaviruses. We also 
systematically carried out subcellular local- 
ization analysis using tagged viral proteins 
and antibodies targeting specific SARS-CoV-2 
proteins. Our results suggest that protein lo- 
calization can often differ when comparing 
individually expressed viral proteins with the 
localization of the same protein in the context 
of infection. This can be because of factors 
such as mislocation driven by tagging, changes 
in localization due to interaction partners, or 
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cellular compartments that are specific to the 
infection state. These differences are notable 
caveats of virus-host interaction studies per- 
formed with tagged, expressed proteins. How- 
ever, previous studies and the work performed 
here show how these data can be powerful for 
the identification of host-targeted processes 
and relevant drug targets. 

These data were integrated with genetic 
data where the interactions uncovered with 
SARS-CoV-2 were perturbed using RNA inter- 
ference (RNAi) and CRISPR in different cel- 
lular systems and viral assays—an effort that 
functionally connected many host factors to 
infection. One of these, Tom70, which we have 
shown binds to ORF9b from both SARS-CoV-1 
and SARS-CoV-2, is a mitochondrial outer 
membrane translocase that has been previ- 
ously shown to be important for mounting an 
interferon response (42). Our functional data, 
however, show that Tom’70 has at least some 
role in promoting infection rather than inhib- 
iting it. Using cryo-EM, we obtained a 3-A 
structure of a region of ORF9b binding to the 
active site of Tom’70. Notably, we found that 
ORF9b is in a markedly different conforma- 
tion than previously visualized. This suggests 
the possibility that ORF9b may partition be- 
tween two distinct structural states, with each 
having a different function and possibly ex- 
plaining its apparent pleiotropy. The exact 
details of functional significance and regu- 
lation of the ORF9b-Tom70 interaction will 
require further experimental elucidation. This 
interaction, however, which is conserved be- 
tween SARS-CoV-1 and SARS-CoV-2, could have 
value as a pan-coronavirus therapeutic target. 

Finally, we attempted to connect our in vitro 
molecular data to clinical information availa- 
ble for COVID-19 patients to understand the 
pathophysiology of COVID-19 and explore the- 
rapeutic avenues. To this end, using GWAS 
datasets of the COVID-HGI (30), we observed 
that increased predicted sIL17RA plasma levels 
were associated with lower risk of COVID-19. 
Notably, we find that IL17RA physically binds 
to SARS-CoV-2 ORF8, and genetic disruption 
results in decreased infection. These collec- 
tive data suggest that future studies should 
be focused on this pathway as both an indi- 
cator and therapeutic target for COVID-19. 
Furthermore, using medical billing data, we 
also observed trends in COVID-19 patients 
on specific drugs indicated by our molecular 
studies. For example, inpatients prescribed 
sigma-ligand typical antipsychotics appear 
to have better COVID-19 outcomes compared 
with users of atypical antipsychotics, which 
do not have anti-SARS-CoV-2 activity in vitro. 
However, we cannot be certain that the sigma 
receptor interaction is the mechanism under- 
pinning this effect, as typical antipsychotics 
are known to bind to a multitude of cellular 
targets, and some atypical antipsychotics, 
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which lack anti-SARS-CoV-2 activity, none- 
theless have reported affinity for rodent sigma 
receptors (table S10). Replication in other pa- 
tient cohorts and further work will be needed 
to see whether there is therapeutic value in 
these connections, but we have at least dem- 
onstrated a strategy wherein protein network 
analyses can be used to make testable predic- 
tions from real-world clinical information. 

We have described an integrative and col- 
laborative approach to study and understand 
pathogenic coronavirus infection, identifying 
conserved targeted mechanisms that are likely 
to be of high relevance for other viruses of this 
family, some of which have yet to infect hu- 
mans. We used proteomics, cell biology, viro- 
logy, genetics, structural biology, biochemistry, 
and clinical and genomic information in an 
attempt to provide a holistic view of SARS- 
CoV-2 and other coronaviruses’ interactions 
with infected host cells. We propose that such 
an integrative and collaborative approach could 
and should be used to study other infectious 
agents as well as other disease areas. 


Materials and methods 
Cells 


HEK293T/17 (HEK293T) cells were procured 
from the University of California, San Francisco 
(UCSF) Cell Culture Facility, and are available 
through UCSF's Cell and Genome Engineering 
Core (https://cgec.ucsf.edu/cell-culture-and- 
banking-services). HEK293T cells were cul- 
tured in Dulbecco’s modified Eagle’s medium 
(DMEM) (Corning) supplemented with 10% 
fetal bovine serum (FBS) (Gibco, Life Techno- 
logies) and 1% penicillin-streptomycin (Corn- 
ing) and maintained at 37°C in a humidified 
atmosphere of 5% CO». Short tandem repeat 
(STR) analysis by the Berkeley Cell Culture 
Facility on 8 August 2017 authenticates these 
as HEK293T cells with 94% probability. 

HeLaM cells (RRID: CVCL_R965) were 
originally obtained from the laboratory of 
M.S. Robinson (CIMR, University of Cambridge, 
UK) and have been routinely tested for my- 
coplasma contamination. HeLaM cells were 
grown in DMEM supplemented with 10% FBS, 
100 U/ml penicillin, 100 ug/ml streptomycin, 
and 2 mM glutamine at 37°C in a5% CO, hu- 
midified incubator. 

A549 cells stably expressing ACE2 (A549- 
ACE2) were a gift from O. Schwartz. A549-ACE2 
cells were cultured in DMEM supplemented 
with 10% FBS, blasticidin (20 ug/ml) (Sigma) 
and maintained at 37°C with 5% CO». STR 
analysis by the Berkeley Cell Culture Facility 
on 17 July 2020 authenticates these as A549 
cells with 100% probability. 

Caco-2 cells (ATTC, HTB-37, RRID:CVCL_ 
0025) were cultured in DMEM with GlutaMAxX 
and pyruvate (Gibco, 10569010) and supple- 
mented with 20% FBS (Gibco, 26140079). For 
Caco-2 cells utilized in Cas9-RNP knockouts, 
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STR analysis by the Berkeley Cell Culture Fa- 
cility on 23 April 2020 authenticates these as 
Caco-2 cells with 100% probability. 

Vero E6 cells were purchased from the 
American Type Culture Collection (ATCC) 
and thus authenticated [VERO C1008 (Vero 
76, clone E6, Vero E6)] (ATCC, CRL-1586). 
Vero E6 cells tested negative for mycoplasma 
contamination. Vero E6 cells were cultured 
in DMEM (Corning) supplemented with 
10% FBS (Gibco, Life Technologies) and 1% 
penicillin-streptomycin (Corning) and main- 
tained at 37°C in a humidified atmosphere of 
5% CO». 


Microbes 


LOBSTER E. coli Expression Strain: LOBSTR- 
[BL21(DE3)] Kerafast no. EC1002. 


Antibodies 
Commercially available primary antibodies 
used in this study 


Rabbit anti-beta-actin (Cell Signaling Technol- 
ogy #4967, RRID:AB_ 330288); mouse anti- 
beta tubulin (Sigma-Aldrich #T8328, RRID: 
AB_ 1844090); rabbit anti-BiP (Cell Signaling 
Technology #3177S, RRID:AB_2119845); mouse 
anti-EEA1 (BD Biosciences #610457, RRID:AB __ 
397830, used at 1:200); mouse anti-ERGIC53 
(Enzo Life Sciences #ALX-804-602-C100, RRID: 
AB_2051363, used at 1:200); anti-GM130; rabbit 
anti-GRP78 BiP (Abcam #Ab21685, RRID:AB _ 
2119834); rabbit anti-SARS-CoV-nucleocapsid 
protein (NP) (Rockland #200-401-A50, RRID: 
AB_ 828403); rabbit anti-PDI (protein disul- 
fide isomerase) (Cell Signaling Technology 
#3501, RRID:AB_2156433); mouse anti-Strep 
tag (QIAGEN #34850, RRID:AB_2810987, used 
at 1:5000); mouse anti-strepMAB (IBA Life- 
sciences #2-1507-001, used at 1:1000); rabbit 
anti-Strep-tag II (Abcam #ab232586); rabbit 
anti-Tom20 (Proteintech #11802-1-AP, RRID: 
AB_ 2207530, used at 1:1000); rabbit anti- 
Tom20 (Cell Signaling Technology #42406, 
RRID:AB_2687663); mouse anti-Tom22 (Santa 
Cruz Biotechnology #sc-101286, RRID:AB_ 
1130526); rabbit anti-Tom40 (Santa Cruz Bio- 
technology #sc-11414, RRID:AB_793274:); mouse 
anti-Tom70 (Santa Cruz #sc-390545, RRID: 
AB_ 2714192, used at 1:500); Rabbit anti-STX5 
(Synaptic Systems 110 053, used at 1:500); and 
ActinStaining Kit 647-Phalloidin (Hypernol 
#8817-01, used at 1:400). 


Commercially available secondary 
antibodies used in this study 


Alexa Fluor 488 chicken anti-mouse immuno- 
globulin G (IgG) (Invitrogen #A21200, RRID_ 
AB_ 2535786, used at 1:400); Alexa Fluor 488 
chicken anti-rabbit IgG (Invitrogen #A21441, 
RRID_AB_10563745, used at 1:400); Alexa 
Fluor 568 donkey anti-sheep IgG (Invitrogen 
#A21099, RRID_AB_10055702, used at 1:400); 
Alexa Fluor Plus 488 goat anti-rabbit (Thermo- 
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Fisher A32731, used at 1:500); Alexa Fluor Plus 
594 goat anti-mouse (ThermoFisher A32742, 
used at 1:500); and goat anti-mouse IgG-HRP 
(horseradish peroxidase) (BioRad #170-6516, 
RRID:AB_11125547, used at 1:20000). 


Noncommercial antisera 


Rabbit anti-SARS-CoV-2-NP antiserum was 
produced by the Garcia-Sastre laboratory and 
used at 1:10000. For information on polyclo- 
nal sheep antibodies targeting SARS-CoV- 
2 proteins, see below, table $3, and https:// 
mrcppu-covid.bio/. 


Coronavirus annotation and plasmid cloning 


SARS-CoV-1 isolate Tor2 (NC_004718) and 
MERS-CoV (NC_019843) were downloaded 
from GenBank and utilized to design 2xStrep- 
tagged expression constructs of ORFs and pro- 
teolytically mature Nsps derived from ORFlab 
(with N-terminal methionines and stop co- 
dons added as necessary). Protein termini 
were analyzed for predicted acylation motifs, 
signal peptides, and transmembrane regions, 
and either the N or C terminus was chosen for 
tagging as appropriate. Finally, reading frames 
were codon optimized and cloned into pLVX- 
EFlalpha-I[RES-Puro (Takara/Clontech) includ- 
ing a 5’ Kozak motif. 


Immunofluorescence microscopy of viral 
protein constructs 


Approximately 60,000 HeLaM cells were 
seeded onto glass coverslips in a 12-well dish 
and grown overnight. The cells were trans- 
fected using 0.5 ug of plasmid DNA and 
either polyethylenimine (Polysciences) or Fu- 
gene HD (Promega; 1 part DNA to 3 parts 
transfection reagent) and grown for a fur- 
ther 16 hours. 

Transfected cells were fixed with 4% para- 
formaldehyde (Polysciences) in phosphate- 
buffered saline (PBS) at room temperature 
for 15 min. The fixative was removed and 
quenched using 0.1 M glycine in PBS. The cells 
were permeabilized using 0.1% saponin in PBS 
containing 10% FBS. The cells were stained 
with the indicated primary and secondary anti- 
bodies for 1 hour at room temperature. The 
coverslips were mounted onto microscope 
slides using ProLong Gold antifade reagent 
(ThermoFisher) and imaged using a UplanApo 
60x oil (NA 1.4) immersion objective on a 
Olympus BX61 motorized wide-field epifluor- 
escence microscope. Images were captured 
using a Hamamatsu Orca monochrome cam- 
era and processed using ImageJ. 

To gain insight into the intracellular dis- 
tribution of each Strep-tagged construct, ~100 
cells per transfection were manually scored. 
Each construct was assigned an intracellular 
distribution in relation to the plasma mem- 
brane, ER, Golgi, cytoplasm, and mitochon- 
dria (scored out of 7). Many of the constructs 
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had several localizations so this was also re- 
flected in the scoring. The scoring also took 
into account the impact of expression level on 
the localization of the constructs. 


Meta-analysis of immunofluorescence data 


We first sorted the data concerning viral pro- 
tein location for all Strep-tagged viral proteins 
expressed individually in three heatmaps (one 
per virus) using a custom R script (“pheatmap” 
package). The information concerning protein 
localization during SARS-CoV-2 infection was 
added as a square border color code in the first 
heatmap, to compare the two different local- 
ization patterns. To compare the predicted ver- 
sus the experimentally determined locations, 
for each protein we took the top scoring 
sequence-based localization prediction from 
DeepLoc (43) if the score was >1. When more 
than one localization can be assigned to the 
same protein, we took as many top scoring 
ones as experimentally assigned localizations 
we had for the same protein. Finally, for each 
cell compartment, we count the number of 
experimentally assigned viral proteins and the 
subset of them predicted to that same com- 
partment as correct predictions. To compare 
changes in protein interactions with changes 
in protein localization (Strep-tagged experi- 
ment versus sequence-based prediction), we 
calculated the Jaccard index of prey overlap 
for each viral protein (SARS-CoV-2 versus 
SARS-CoV-1 and SARS-CoV-2 versus MERS- 
CoV) and plotted them together, for proteins 
with the same localization and for proteins 
with different localization. 


Generation of polyclonal sheep antibodies 
targeting SARS-CoV-2 proteins 


Sheep were immunized with individual N- 
terminal glutathione S-transferase (GST)-tagged 
SARS-CoV-2 recombinant proteins or N-terminal 
maltose binding protein (MBP)-tagged pro- 
teins (for SARS-CoV-2 S, S-RBD, and ORF7a), 
followed by up to five booster injections 
4 weeks apart from each other. Sheep were sub- 
sequently bled and IgGs were affinity purified 
using the specific recombinant N-terminal 
MBP-tagged viral proteins. Each antiserum 
specifically recognized the appropriate native 
viral protein. Characterization of each antibody 
can be found at https://mrcppu-covid.bio/. All 
antibodies generated can be requested at 
https://mrceppu-covid.bio/. Also see table S3. 


Immunofluorescence microscopy of infected 
Caco-2 cells 


For infection experiments in human colon 
epithelial Caco-2 cells (ATCC, HTB-37), SARS- 
CoV-2 isolate Muc-IMB-1 (provided by the 
Bundeswehr Institute of Microbiology, Munich, 
Germany) was used. SARS-CoV-2 was propa- 
gated in Vero E6 cells in DMEM supplemented 
with 2% FBS. All work involving live SARS- 
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CoV-2 was performed in the BSL3 facility of 
the Institute of Virology, University Hospital 
Freiburg, and was approved according to the 
German Act of Genetic Engineering by the lo- 
cal authority (Regierungspraesidium Tuebingen, 
permit UNI.FRK.05.16/05). 

Caco-2 human colon epithelial cells seeded 
on glass coverslips were infected with SARS- 
CoV-2 {Strain Muc-IMB-1/2020, second pas- 
sage on Vero E6 cells [2 x 10° plaque-forming 
units (PFU)/ml]} at a multiplicity of infection 
(MOI) of 0.1. At 24 hours postinfection, cells 
were washed with PBS and fixed in 4% para- 
formaldehyde in PBS for 20 min at room 
temperature, followed by 5 min of quenching 
in 0.1 M glycine in PBS at room temperature. 
Cells were permeabilized and blocked in 0.1% 
saponin in PBS supplemented with 10% FBS 
for 45 min at room temperature and incubated 
with primary antibodies for 1 hour at room 
temperature. After washing 15 min with block- 
ing solution, AF568-labeled donkey-anti-sheep 
(Invitrogen, #421099; 1:400) secondary anti- 
body as well as AF4647-labeled Phalloidin 
(Hypermol, #8817-01; 1:400) were applied 
for 1 hour at room temperature. Subsequent 
washing was followed by embedding in Diam- 
ond Antifade Mountant with 4’,6-diamidino- 
2-phenylindole (DAPI) (ThermoFisher, #P36971). 
Fluorescence images were generated using a 
LSM800 confocal laser-scanning microscope 
(Zeiss) equipped with a 63X, 1.4 NA oil objec- 
tive and Airyscan detector and the Zen blue 
software (Zeiss) and processed with Zen blue 
software and ImageJ/Fiji. 


Transfection and cell harvest for 
immunoprecipitation experiments 


For each affinity purification [SARS-CoV-1 
baits, MERS-CoV baits, green fluorescent pro- 
tein (GFP)-2xStrep, or empty vector controls], 
10 million HEK293T cells were transfected 
with up to 15 ug of individual expression con- 
structs using PolyJet transfection reagent 
(SignaGen Laboratories) at a 1:3 ug:ul ratio 
of plasmid to transfection reagent on the 
basis of the manufacturer’s protocol. After 
>38 hours, cells were dissociated at room 
temperature using 10 ml PBS without calcium 
and magnesium (D-PBS) with 10 mM ethyl- 
enediaminetetraacetic acid (EDTA) for at least 
5 min, pelleted by centrifugation at 200 x g, at 
4°C for 5 min, washed with 10 ml D-PBS, pel- 
leted once more, and frozen on dry ice before 
storage at —80°C for later immunoprecipitation 
analysis. For each bait, three independent 
biological replicates were prepared. 

Whole-cell lysates were resolved on 4 to 20% 
Criterion SDS-polyacrylamide gel electropho- 
resis (SDS-PAGE) gels (Bio-Rad Laboratories) 
to assess Strep-tagged protein expression by 
immunoblotting using mouse anti-Strep tag 
antibody 34850 (QIAGEN) and anti-mouse 
HRP secondary antibody (BioRad). 
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Anti-Strep tag affinity purification 

Frozen cell pellets were thawed on ice for 15 
to 20 min and suspended in 1 ml lysis buffer 
[immunoprecipitation (IP) buffer (50 mM tris- 
HCl, pH 7.4 at 4°C; 150 mM NaCl, 1 mM 
EDTA) supplemented with 0.5% Nonidet P 40 
Substitute (NP-40; Fluka Analytical) and cCOm- 
plete mini EDTA-free protease and PhosSTOP 
phosphatase inhibitor cocktails (Roche)]. Sam- 
ples were then freeze-fractured by refreezing 
on dry ice for 10 to 20 min, then rethawed and 
incubated on a tube rotator for 30 min at 
4°C. Debris was pelleted by centrifugation 
at 13,000 x g, at 4°C for 15 min. Up to 56 sam- 
ples were arrayed into a 96-well Deepwell 
plate for affinity purification on the KingFisher 
Flex Purification System (Thermo Scientific) 
as follows: MagStrep “type3” beads (30 ul; IBA 
Lifesciences) were equilibrated twice with 1 ml 
wash buffer (IP buffer supplemented with 
0.05% NP-40) and incubated with 0.95 ml 
lysate for 2 hours. Beads were washed three 
times with 1 ml wash buffer and then once 
with 1 ml IP buffer. Beads were released into 
75 ul denaturation-reduction buffer [2 M urea, 
50 mM Tris-HCl pH 8.0, 1 mM dithiothreitol 
(DTT)] in advance of on-bead digestion. All 
automated protocol steps were performed at 
4°C using the slow mix speed and the follow- 
ing mix times: 30 s for equilibration and wash 
steps, 2 hours for binding, and 1 min for final 
bead release. Three 10-s bead collection times 
were used between all steps. 


On-bead digestion for affinity purification 


Bead-bound proteins were denatured and re- 
duced at 37°C for 30 min, alkylated in the dark 
with 3 mM iodoacetamide for 45 min at room 
temperature, and quenched with 3 mM DTT 
for 10 min. To offset evaporation, 22.5 ul 50 mM 
Tris-HCl, pH 8.0 were added before trypsin 
digestion. Proteins were then incubated at 
37°C, initially for 4 hours with 1.5 ul trypsin 
(0.5 ug/ul; Promega) and then another 1 to 
2 hours with 0.5 ul additional trypsin. All 
steps were performed with constant shaking 
at 1100 rpm on a ThermoMixer C incubator. 
Resulting peptides were combined with 50 ul 
50 mM Tris-HCl, pH 8.0 used to rinse beads 
and acidified with trifluoroacetic acid (0.5% 
final, pH < 2.0). Acidified peptides were de- 
salted for MS analysis using a BioPureSPE 
Mini 96-Well Plate (20 mg PROTO 300 C18; 
The Nest Group, Inc.) according to standard 
protocols. 


MS operation and peptide search 


Samples were resuspended in 4% formic acid, 
2% acetonitrile solution, and separated by a 
reversed-phase gradient over a nanoflow C18 
column (Dr. Maisch). HPLC buffer A was 
composed of 0.1% formic acid, and HPLC 
buffer B was composed of 80% acetonitrile 
in 0.1% formic acid. Peptides were eluted by 
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a linear gradient from 7 to 36% B over the 
course of 52 min, after which the column 
was washed with 95% B and re-equilibrated 
at 2% B. Each sample was directly injected 
by means of an Easy-nLC 1200 (Thermo Fisher 
Scientific) into a Q-Exactive Plus mass spec- 
trometer (Thermo Fisher Scientific) and analy- 
zed with a 75-min acquisition, with all MS1 
and MS2 spectra collected in the orbitrap; 
data were acquired using the Thermo soft- 
ware Xcalibur (4.2.47) and Tune (2.11 QF1 
Build 3006). For all acquisitions, QCloud was 
used to control instrument longitudinal per- 
formance during the project (44). All proteomic 
data were searched against the human pro- 
teome (uniprot reviewed sequences downloaded 
28 February 2020), enhanced green fluorescent 
protein (EGFP) sequence, and the SARS-CoV 
or MERS protein sequences using the default 
settings for MaxQuant (version 1.6.12.0) (45). 
Detected peptides and proteins were filtered 
to 1% false discovery rate in MaxQuant. All 
MS raw data and search results files have been 
deposited to the ProteomeXchange Consortium 
via the PRIDE partner repository with the 
dataset (PXD identifier PXD021588). 


High-confidence protein interaction scoring 


Identified proteins were then subjected to PPI 
scoring with both SAINTexpress (version 3.6.3) 
and MiST (https://github.com/kroganlab/mist) 
(6, 7). We applied a two-step filtering strategy 
to determine the final list of reported inter- 
actors, which relied on two different scoring 
stringency cut-offs. In the first step, we chose 
all protein interactions that had a MiST score 
>0.7, a SAINTexpress Bayesian false-discovery 
rate (BFDR) <0.05, and an average spectral 
count =2. For all proteins that fulfilled these 
criteria, we extracted information about the 
stable protein complexes that they partici- 
pated in from the CORUM (46) database of 
known protein complexes. In the second step, 
we then relaxed the stringency and recovered 
additional interactors that (i) formed com- 
plexes with interactors determined in filtering 
step 1 and (ii) fulfilled the following criteria: 
MiST score =0.6, SAINTexpress BFDR <0.05, 
and average spectral counts =2. Proteins that 
fulfilled filtering criteria in either step 1 or 
step 2 were considered to be high-confidence 
protein-protein interactions (HC-PPIs). 

Using this filtering criteria, nearly all of our 
baits recovered a number of HC-PPIs in close 
alignment with previous datasets reporting 
an average of around six PPIs per bait (47). 
However, for a subset of baits, we observed 
a much higher number of PPIs that passed 
these filtering criteria. For these baits, the 
MiST scoring was instead performed using 
a larger in-house database of 87 baits that 
were prepared and processed in an analogous 
manner to this SARS-CoV-2 dataset. This was 
done to provide a more comprehensive col- 
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lection of baits for comparison, to minimize 
the classification of nonspecifically binding 
background proteins as HC-PPIs. This was 
performed for SARS-CoV-1 baits (M, Nsp12, 
Nsp13, Nsp8, and ORF7b), MERS-CoV baits 
(Nsp13, Nsp2, and ORF4a), and SARS-CoV-2 
Nsp16. SARS-CoV-2 Nsp16 MiST was scored 
using the in-house database as well as all 
previous SARS-CoV-2 data (5). 


Hierarchical clustering of virus-human 
protein interactions 


Hierarchical clustering was performed on 
interactions for (i) viral bait proteins shared 
across all three viruses and (ii) passed the high- 
confidence scoring criteria (MiST score =0.6, 
SAINTexpress BFDR <0.05, and average spec- 
tral counts =2) in at least one virus. We clus- 
tered using a new interaction score (K), which 
we defined as the average between the MiST 
and SAINT score for each virus-human inter- 
action. This was done to provide a single score 
that captured the benefits from each scoring 
method. Clustering was performed using the 
ComplexHeatmap package in R, using the 
“average” clustering method and “euclidean” 
distance metric. K-means clustering (k = 
was applied to capture all possible combina- 
tions of interaction patterns between viruses. 


GO enrichment analysis on clusters 


Sets of genes found in seven clusters were 
tested for enrichment of GO terms, which 
was performed using the enricher function 
of clusterProfiler package in R (48). The GO 
terms were obtained from the C5 collection 
of Molecular Signature Database (MSigDBv7.1) 
and include biological process, cellular com- 
ponent, and molecular function ontologies. 
Significant GO terms were identified (adjusted 
P < 0.05) and further refined to select non- 
redundant terms. To select nonredundant 
gene sets, we first constructed a GO term tree 
based on distances (1 — Jaccard similarity co- 
efficients of shared genes) between the signif- 
icant terms. The GO term tree was cut at a 
specific level (A = 0.99) to identify clusters of 
nonredundant gene sets. For results with mul- 
tiple significant terms belonging to the same 
cluster, we selected the term with the lowest 
adjusted P value. 


Sequence similarity analysis 


Protein sequence similarity was assessed by 
comparing the protein sequences from SARS- 
CoV-1 and MERS-CoV to SARS-CoV-2 for 
orthologous viral bait proteins. The corre- 
sponding PPI similarity was represented by 
a Jaccard index, using the high-confidence 
interactomes for each virus. 


GO enrichment and PPI similarity analysis 


The high-confidence interactors of the three 
viruses were tested for enrichment of GO terms 
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as described above. We then identified GO 
terms that are significantly enriched (adjusted 
P value < 0.05) in all three viruses. For each 
enriched term, we generated the list of its asso- 
ciated genes and computed the Jaccard index 
of pairwise comparisons of the three viruses. 


Orthologous versus nonorthologous 
interactions analysis 


For a given pair of viruses, we identified all 
pairs of baits that share interactors and cat- 
egorized these into orthologous and nonortho- 
logous groups on the basis of whether the two 
baits were orthologs or not. We then summed 
up the total number of shared interactors in 
each group to calculate the corresponding 
fractions. This was performed for all pairwise 
combinations of the three viruses. 


Structural modeling and comparison of 
MERS-CoV ORF4a and SARS-CoV-2 Nsp8& 


To obtain a sensitive sequence comparison 
between MERS-CoV ORF4a and SARS-CoV-2 
Nsp8, we took into consideration their homo- 
logs. We first searched for homologs of these 
proteins in the UniRef30 database using hhblits 
(1 iteration, E-value cutoff 1 x 107°) (49). Sub- 
sequently, the resulting alignments were fil- 
tered to include only sequences with at least 
80% coverage to the corresponding query se- 
quence, and hidden Markov models (HMMs) 
were created using hhmake. Finally, the HMMs 
of ORF4a and Nsp8 homologs were locally 
aligned using hhalign. The structure of ORF4a 
was predicted de novo using trRosetta (50). To 
provide greater coverage than that provided 
by experimental structures, SARS-CoV-2 Nsp8& 
was modeled using the structure of its SARS- 
CoV homolog as template (PDB ID: 2AHM) 
(5D) using SWISS-MODEL (52). To search for 
local structural similarities between ORF4a 
and Nsp8, we used Geometricus, a structure 
embedding tool based on three-dimensional 
(3D) rotation invariant moments (53). This 
generates so-called shape-mers, analogous 
to sequence k-mers. The structures were frag- 
mented into overlapping k-mers on the basis 
of the sequence (& = 20) and into overlapping 
spheres surrounding each residue (radius = 
15 A). To ensure that the similarities found 
between these distinct structures were sig- 
nificant, we used a high resolution of 7 to 
define the shape-mers. This resulted in the 
identification of four different shape-mers 
common to ORF4a and Nsp8. We aligned the 
entire ORF4a structure with residues 96 to 191 
of the Nsp8 structure (i.e., after removal of 
the long N-terminal helix) using the Caretta 
structural alignment algorithm (54), using 
3D rotation invariant moments (53) for ini- 
tial superposition. We optimized parameters 
to maximize the Caretta score. The resulting 
alignment used & = 30, radius = 16 A, gap open 
penalty = 0.05, gap extend penalty = 0.005, 
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and had a root mean square deviation (RMSD) 
of 7.6 A across 66 aligning residues. 


DIS analysis 


We computed a DIS for interactions that (i) 
originated from viral bait proteins shared across 
all three viruses and (ii) passed the high- 
confidence scoring criteria (See above) in at 
least one virus. We defined the DIS to be the 
difference between the interaction scores (K) 
from each virus. A DIS near O indicates that 
the interaction is confidently shared between 
the two viruses being compared, whereas a 
DIS near -1 or +1 indicates that the host- 
protein interaction is specific for one virus or 
the other. We computed a fourth DIS (SARS- 
MERS) by averaging K from SARS-CoV-1 and 
SARS-CoV-2 before calculating the difference 
with MERS-CoV. Here, a DIS near +1 indicates 
SARS-specific interactions (shared between 
SARS-CoV-1 and SARS-CoV-2 but absent in 
MERS-CoV), a DIS near -1 indicates MERS- 
specific interactions (present in MERS-CoV 
and absent or lowly confident in both SARS- 
CoVs), and a DIS near O indicates interactions 
shared between all three viruses. 

For each pairwise virus comparison, as well 
as the SARS-MERS comparison, the DIS was 
defined on the basis of cluster membership of 
interactions (Fig. 3A). For the SARS2-SARS1 
comparison, interactions from every cluster 
except 5 were used, as those interactions 
are considered absent from both SARS-CoV-2 
and SARS-CoV-1. For the SARS2-MERS com- 
parison, interactions from all clusters except 
3 were used. For the SARSI-MERS comparison, 
interactions from all clusters except 6 were 
used. For the SARS-MERS comparison, only 
interactions from clusters 2, 4, and 5 were used. 


Network generation and visualization 


PPI networks were generated in Cytoscape (55) 
and subsequently annotated using Adobe Illus- 
trator. Host-host physical interactions, protein 
complex definitions, and biological process 
groupings were derived from CORUM (46), 
GO (biological process), and manually curated 
from literature sources. All networks were 
deposited in NDEx (56). 


siRNA library and transfection in A549-ACE2 cells 


An OnTargetPlus siRNA SMARTpool library 
(Horizon Discovery) was purchased targeting 
331 of the 332 human proteins previously 
identified to bind SARS-CoV-2 (5) (PDE4DIP 
was not available for purchase and was ex- 
cluded from the assay). This library was ar- 
rayed in a 96-well format, with each plate 
also including two nontargeting siRNAs and 
one siRNA pool targeting ACE2 (table S12). 
The siRNA library was transfected into A549 
cells stably expressing ACE2 (A549-ACE2, pro- 
vided by O. Schwartz), using Lipofectamine 
RNAiIMAX reagent (Thermo Fisher). Briefly, 
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6 pmol of each siRNA pool were mixed with 
0.25 ul RNAiMAX transfection reagent and 
OptiMEM (Thermo Fisher) in a total volume 
of 20 ul. After a 5 min incubation period, the 
transfection mix was added to cells seeded in 
a 96-well format. Twenty-four hours after trans- 
fection, the cells were subjected to SARS-CoV-2 
infection, as described in the section Viral 
infection and quantification assay in A549- 
ACE2 cells, or incubated for 72 hours to assess 
cell viability using the CellTiter-Glo lumines- 
cent viability assay according to the manufac- 
turer’s protocol (Promega). Luminescence was 
measured in a Tecan Infinity 2000 plate reader, 
and percentage viability calculated relative to 
untreated cells (00% viability) and cells lysed 
with 20% ethanol or 4% formalin (0% viabil- 
ity), included in each experiment. 


Viral infection and quantification assay 
in A549-ACE2 cells 


Cells seeded in a 96-well format were inocu- 
lated with a SARS-CoV-2 stock (BetaCoV/ 
France/IDF0372/2020 strain, generated and 
propagated once in Vero E6 cells and a gift 
from the National Reference Centre for Res- 
piratory Viruses at Institut Pasteur, Paris, orig- 
inally supplied through the European Virus 
Archive goes Global platform) at a MOI of 0.1 
PFU per cell. After a 1-hour incubation period 
at 37°C, the virus inoculum was removed, and 
replaced by DMEM containing 2% FBS (Gibco, 
Thermo Fisher). Seventy-two hours postinfec- 
tion, the cell culture supernatant was collec- 
ted, heat inactivated at 95°C for 5 min, and 
used for RT-qPCR analysis to quantify viral 
genomes present in the supernatant. Briefly, 
SARS-CoV-2-specific primers targeting the 
N gene region: 5’-TAATCAGACAAGGAACT- 
GATTA-3' (forward) and 5'-CGAAGGTGTGA- 
CTTCCATG-3' (reverse) (57) were used with 
the Luna Universal One-Step RT-qPCR Kit 
(New England Biolabs) in an Applied Bio- 
systems QuantStudio 6 thermocycler, with the 
following cycling conditions: 55°C for 10 min, 
95°C for 1 min, and 40 cycles of 95°C for 10 s, 
followed by 60°C for 1 min. The number of 
viral genomes is expressed as PFU equivalents 
per milliliter, and was calculated by perform- 
ing a standard curve with RNA derived from a 
viral stock with a known viral titer. 


Knockdown validation with RT-qPCR 
in A549-ACE2 cells 


Gene-specific qPCR primers targeting all genes 
represented in the OnTargetPlus library were 
purchased and arrayed in a 96-well format 
identical to that of the siRNA library (IDT; 
table S13). A549-ACE2 cells treated with siRNA 
were lysed using the Luna Cell Ready Lysis 
Module (New England Biolabs) following the 
manufacturer’s protocol. The lysate was used 
directly for gene quantification by RT-qPCR 
with the Luna Universal One-Step RT-qPCR 
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Kit (New England Biolabs), using the gene- 
specific PCR primers and glyceraldehyde- 
3-phosphate dehydrogenase (GAPDH) as a 
housekeeping gene. The following cycling 
conditions were used in an Applied Biosystems 
QuantStudio 6 thermocycler: 55°C for 10 min, 
95°C for 1 min, and 40 cycles of 95°C for 10 s, 
followed by 60°C for 1 min. The fold change in 
gene expression for each gene was derived 
using the 2-“““'_, 2 (Delta Delta CT) method 
(58), normalized to the constitutively expressed. 
housekeeping gene GAPDH. Relative changes 
were generated comparing the control siRNA 
knockdown transfected cells to the cells trans- 
fected with each siRNA. 


Single guide RNA selection for Cas9 knockout screen 


Single guide RNAs (sgRNAs) were designed 
according to Synthego’s multiguide gene knock- 
out (59). Briefly, two or three sgRNAsS are 
bioinformatically designed to work in a co- 
operative manner to generate small, knockout- 
causing, fragment deletions in early exons (fig. 
S18). These fragment deletions are larger than 
standard indels generated from single guides. 
The genomic repair patterns from a multiguide 
approach are highly predictable on the basis of 
the guide spacing and design constraints to 
limit off-targets, resulting in a higher proba- 
bility protein knockout phenotype (table S14). 


sgRNA synthesis for Cas9 knockout screen 


RNA oligonucleotides were chemically synthe- 
sized on Synthego solid-phase synthesis plat- 
form, using CPG solid support containing a 
universal linker. 5-benzylthio-1H-tetrazole (BTT, 
0.25 M solution in acetonitrile) was used for 
coupling, [3-((dimethylamino-methylidene)amino)}- 
3H-1,2,4-dithiazole-3-thione (DDTT, 0.1 M so- 
lution in pyridine)] was used for thiolation, 
dichloroacetic acid (DCA, 3% solution in to- 
luene) was used for detritylation. Modified 
sgRNA were chemically synthesized to contain 
2'-O-methyl analogs and 3’ phosphorothioate 
nucleotide interlinkages in the terminal three 
nucleotides at both 5’ and 3’ ends of the RNA 
molecule. After synthesis, oligonucleotides were 
subject to a series of deprotection steps, fol- 
lowed by purification by solid-phase extrac- 
tion (SPE). Purified oligonucleotides were 
analyzed by electrospray ionization mass spec- 
trometry (ESI-MS). 


Arrayed knockout generation with Cas9-RNPs 


For Caco-2 transfection, 10 pmol Streptococcus 
Pyogenes NLS-Sp.Cas9-NLS (SpCas9) nuclease 
(Aldevron; 9212) was combined with 30 pmol 
total synthetic sgRNA (10 pmol each sgRNA, 
Synthego) to form ribonucleoproteins (RNPs) 
in 20 ul total volume with SF Buffer (Lonza 
V5SC-2002) and allowed to complex at room 
temperature for 10 min. 

All cells were dissociated into single cells 
using TrypLE Express (Gibco), resuspended in 
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culture media and counted. 100,000 cells per 
nucleofection reaction were pelleted by cen- 
trifugation at 200 x g for 5 min. After centrifu- 
gation, cells were resuspended in transfection 
buffer according to cell type and diluted to 2 x 
10* cells/ul. Five ul of cell solution was added 
to preformed RNP solution and gently mixed. 
Nucleofections were performed on a Lonza HT 
384-well nucleofector system (Lonza, #AAU- 
1001) using program CM-150 for Caco-2. Im- 
mediately after nucleofection, each reaction 
was transferred to a tissue-culture treated 96- 
well plate containing 100 ul of normal culture 
media and seeded at a density of 50,000 cells 
per well. Transfected cells were incubated 
following standard protocols. 


Quantification of arrayed knockout efficiency 


Two days after nucleofection, genomic DNA 
was extracted from cells using DNA Quick- 
Extract (Lucigen, #QE09050). Briefly, cells were 
lysed by removal of the spent media followed 
by addition of 40 ul of QuickExtract solution to 
each well. Once the QuickExtract DNA Extrac- 
tion Solution was added, the cells were scraped 
off the plate into the buffer. After transfer to 
compatible plates, DNA extract was then in- 
cubated at 68°C for 15 min followed by 95°C for 
10 min in a thermocycler before being stored 
for downstream analysis. 

Amplicons for indel analysis were generated. 
by PCR amplification with NEBNext polymer- 
ase (NEB, #M0541) or AmpliTaq Gold 360 poly- 
merase (Thermo Fisher Scientific, #4398881) 
according to the manufacturer’s protocol. The 
primers were designed to create amplicons 
between 400 and 800 base pairs (bp), with both 
primers at least 100 bp from any of the sgRNA 
target sites (table S15). PCR products were 
cleaned-up and analyzed by Sanger sequenc- 
ing (Genewiz). Sanger data files and sgsRNA 
target sequences were input into Inference 
of CRISPR Edits (ICE) analysis (ice.synthego. 
com) to determine editing efficiency and to 
quantify generated indels (60). Percentage of 
alleles edited is expressed as an ice-d score. 
This score is a measure of how discordant the 
sanger trace is before versus after the edit. It is 
a simple and robust estimate of editing effi- 
ciency in a pool, especially suited to highly dis- 
ruptive editing techniques like multiguide. 


Identification of essential genes for siRNA and 
Cas9 knockout screen 


We used longitudinal imaging in A549 cells 
to assess cell viability (fig. S18). For bench- 
marking, relative cell viability was measured 
by CellTiter-Glo Luminescent Cell Viability As- 
say (Promega; G7571) as per manufacturer’s 
instructions. Briefly, two passages postnucleo- 
fection A549 siRNA pools cultured in 96-well 
tissue-culture treated plates (Corning, #3595) 
were lysed in the CellTIter-Glo reagent, by re- 
moving spent media and adding 100 ul of the 
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CellTiter-Glo reagent containing the CellTiter- 
Glo buffer and CellTiter-Glo Substrate. Cells 
were placed on an orbital shaker for 2 min on 
a SpectraMax 1D5 (Molecular Devices) and 
then incubated in the dark at room tempera- 
ture for 10 min. Completely lysed cells were 
pipette mixed and 25 ul were transferred to a 
384-well assay plate (Corning, #3542). The 
luminescence was recorded on a SpectraMax 
iD5 (Molecular Devices) with an integration 
time of 0.25 s per well. Luminescence readings 
were all normalized to the without-sgRNA 
control condition. 

To determine cell viability in Caco-2 knock- 
outs we used longitudinal imaging (fig. S18). 
All gene knockout pools were maintained for 
a minimum of six passages to determine the 
effect of loss of protein function on cell fitness 
before viral infection. Viability was determined 
through longitudinal imaging and automated 
image analysis using a Celigo Imaging Cytom- 
eter (Celigo). Each gene knockout pool was split 
in triplicate wells on separate plates. Every day, 
except the day of seeding, each well was scan- 
ned and analyzed using built-in Confluence 
imaging parameters using autoexposure and 
autofocus with an offset of -45 um. Analysis 
was performed with standard settings except 
for an intensity threshold setting of 8. Con- 
fluency was averaged across three wells and 
plotted over time. Viability genes were deter- 
mined as pools that, after six passages, re- 
mained <20% confluent 5 days after seeding. 
Genes deemed essential were excluded from 
the knockout screen. 


Cells, virus, and infections for Caco-2 Cas9 
knockout screen 


Wild-type and CRISPR-edited Caco-2 cells were 
grown at 37°C, 5% CO. in DMEM, 10% FBS. 
SARS-CoV-2 stocks were grown and titered on 
Vero E6 cells as described previously (67). Wild- 
type and CRISPR-edited Caco-2 cell lines were 
infected with SARS-CoV-2 at a MOI of 0.01 in 
DMEM supplemented with 2% FBS. Seventy- 
two hours postinfection, supernatants were 
harvested and stored at —80°C and the Caco-2 
wild-type (WT) and CRISPR knockout (KO) 
cells were fixed with 10% neutral buffered 
formalin (NBF) for 1 hour at room temper- 
ature to enable further analysis. 


Focus-forming assay for Caco-2 Cas9 knockout screen 


Vero E6 cells were plated into 96-well plates 
at confluence (50,000 cells per well) in DMEM 
supplemented with 10% heat-inactivated FBS 
(Gibco). Before infection, supernatants from 
infected Caco-2 WT and CRISPR KO cells were 
thawed and serially diluted from 107 to 107°. 
Growth media was removed from the Vero E6 
cells and 40 ul of each virus dilution was 
plated. After 1 hour of adsorption at 37°C, 5% 
CO., 40 ul of 2.4% microcrystalline cellulose 
(MCC) overlay supplemented with DMEM 
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powdered media (Gibco) to a concentration 
of 1x was added to each well of the 96-well 
plate to achieve a final MCC overlay concen- 
tration of 1.2%. Plates were then incubated 
at 37°C, 5% CO. for 24 hours. The MCC overlay 
was gently removed and cells were fixed with 
10% NBF for 1 hour at room temperature. After 
removal of NBF, monolayers were washed with 
ultrapure water and ice-cold 100% methanol/ 
0.3% H-O, was added for 30 min to permea- 
bilize the cells and quench endogenous perox- 
idase activity. Monolayers were then blocked 
for 1 hour in PBS with 5% nonfat dry milk 
(NFDM). After blocking, monolayers were in- 
cubated with SARS-CoV N primary antibody 
(Novus Biologicals; NB100-56576; 1:2000) for 
1 hour at room temperature in PBS, 5% NFDM. 
Monolayers were washed with PBS and in- 
cubated with an HRP-conjugated secondary 
antibody for 1 hour at room temperature in 
PBS with 5% NFDM. Secondary antibody was 
removed, monolayers were washed with PBS, 
and then developed using TrueBlue substrate 
(KPL) for 30 min. Plates were imaged on a 
Bio-Rad Chemidoc utilizing a phosphorscreen 
and foci were counted by eye to calculate focus- 
forming units per ml (FFU/ml) for each knock- 
out. The original formalin-fixed Caco-2 WT 
and CRISPR KO cells were stained with DAPI 
(Thermo Scientific) and imaged on a Cytation 
5-plate reader to determine cell viability. Wells 
containing no cells were excluded from fur- 
ther analyses. 


Quantitative analysis and scoring of knockdown 
and knockout library screens 


Virus readout by qPCR (A549-ACE2, expressed 
as plaque-forming units per milliliter) and 
focus-forming assay readouts (Caco-2, focus- 
forming units per milliliter) were processed 
using the RNAither package (www.bioconductor. 
org/packages/release/bioc/html/RNAither.html) 
in the statistical computing environment R. 
The two datasets were normalized separately, 
using the following method. The readouts 
were first log transformed (natural logarithm), 
and robust g-scores [using median and MAD 
(median absolute deviation) instead of mean 
and standard deviation] were then calculated 
for each 96-well plate separately. g-scores of 
multiple replicates of the same perturbation 
were averaged into a final z-score for presen- 
tation in Fig. 5. No filtering was done on the 
basis of differences in replicate z-scores, but all 
replicate scores are individually listed in tables 
S6 and S7. We suggest consulting the replicate 
g-scores for all genes and perturbations of in- 
terest. The A549-ACE2 siRNA screen includes 
three replicates (or more) of each perturba- 
tion, and the Caco-2 CRISPR screen includes 
two replicates (or more) of each perturbation. 
The results from the A549-ACE2 screen cover 
all 332 screened genes (331 SARS-CoV-2 in- 
teractors plus ACE2). The results from the 
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Caco-2 screen cover 286 of the screened genes 
plus ACE2. The remaining Caco-2 genes were 
either deemed essential, failed editing, or 
failed in the focus-forming assay. 


Antiviral drug and cytotoxicity assays 
(A549-ACE2 cells) 


In total, 2500 A549-ACE2 cells were seeded 
into 96- or 384-well plates in DMEM (10% 
FBS) and incubated for 24 hours at 37°C, 5% 
CO». Two hours before infection, the media 
was replaced with 120 ul (96-well format) or 
50 ul (384-well format) of DMEM (2% FBS) 
containing the compound of interest at the 
indicated concentration. At the time of infec- 
tion, the media was replaced with virus in- 
oculum (MOI 0.1 PFU per cell) and incubated 
for 1 hour at 37°C, 5% CO». After the ad- 
sorption period, the inoculum was removed, 
replaced with 120 ul (96-well format) or 50 ul 
(384-well format) of drug-containing media, 
and cells were incubated for an additional 
72 hours at 37°C, 5% CO.. At this point, the 
cell culture supernatant was harvested, and 
viral load was assessed by RT-qPCR (as de- 
scribed in the section Viral infection and 
quantification assay in A549-ACE2 cells). 
Viability was assayed using the CellTiter-Glo 
assay following the manufacturer’s protocol 
(Promega). Luminescence was measured in a 
Tecan Infinity 2000 plate reader, and percent- 
age viability calculated relative to untreated 
cells (100% viability) and cells lysed with 20% 
ethanol or 4% formalin (0% viability), included 
in each experiment. 


Antiviral drug and cytotoxicity assays 
(Vero E6 cells) 


Viral growth and cytotoxicity assays in the 
presence of inhibitors were performed as 
previously described (5). In total, 2000 Vero 
E6 cells were seeded into 96-well plates in 
DMEM (10% FBS) and incubated for 24 hours 
at 37°C, 5% CO,. Two hours before infection, 
the medium was replaced with 100 ul of 
DMEM (2% FBS) containing the compound 
of interest at concentrations 50% greater 
than those indicated, including a DMSO con- 
trol. SARS-CoV-2 virus (100 PFU; MOI 0.025) 
was added in 50 ul of DMEM (2% FBS), bring- 
ing the final compound concentration to those 
indicated. Plates were then incubated for 
48 hours at 37°C. After infection, supernatants 
were removed, and cells were fixed with 4% 
formaldehyde for 24 hours before being re- 
moved from the BSL3 facility. The cells were 
then immunostained for the viral NP protein 
(rabbit antisera produced in the Garcia-Sastre 
laboratory; 1:10,000) with a DAPI counter- 
stain. Infected cells (488 nm) and total 
cells (DAPI) were quantified using a Celigo 
(Nexcelcom) imaging cytometer. Infectivity is 
measured by the accumulation of viral NP 
protein in the nucleus of the cells (fluores- 
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cence accumulation). Percent infection was 
quantified as {[(number of infected cells / 
total cells) — background] x 100}, and the 
DMSO control was then set to 100% infec- 
tion for analysis. The IC;> and ICg9 for each 
experiment was determined using the Prism 
(GraphPad Software) software. Cytotoxicity 
measurements were performed using the 
MITT assay (Roche), according to the manu- 
facturer’s instructions. Cytotoxicity was per- 
formed in uninfected Vero E6 cells with same 
compound dilutions and concurrent with 
viral replication assay. All assays were per- 
formed in biologically independent triplicates. 
Sourcing information for all drugs tested may 
be found in table S10. 


Coimmunoprecipitation assays for ORF9b 
and Tom70 


HEK293T and A549 cells were transfected 
with the indicated mammalian expression plas- 
mids using Lipofectamine 2000 (Invitrogen) 
and TransIT-X2 (Mirus Bio), respectively. Twenty- 
four hours after transfection, cells were har- 
vested and lysed in NP-40 lysis buffer [0.5% 
Nonidet P 40 Substitute (NP-40; Fluka Ana- 
lytical), 50 mM Tris-HCl, pH 7.4 at 4°C, 150 mM 
NaCl, 1 mM EDTA] supplemented with cOm- 
plete mini EDTA-free protease and PhosSTOP 
phosphatase inhibitor cocktails (Roche). Clari- 
fied cell lysates were incubated with Streptac- 
tin Sepharose beads (IBA) for 2 hours at 4°C, 
followed by five washes with NP-40 lysis buf- 
fer. Protein complexes were eluted in the SDS 
loading buffer and were analyzed by Western 
blotting with the indicated antibodies. 


Quantification of Tom70 down-regulation in 
HeLaM cells overexpressing ORF9b 


HeLaM cells were transiently transfected with 
plasmids encoding GFP-Strep, SARS-CoV-1 
ORF9b-Strep, or SARS-CoV-2 ORF9b-Strep. 
The next day, the cells were fixed using 4% 
paraformaldehyde and immunostained with 
antibodies against Strep tag, and Tom20 or 
Tom70. Representative images for each con- 
struct were captured by acquiring a single 
optical section using a Nikon A1 confocal fitted 
with a CFI Plan Apochromat VC 60x oil ob- 
jective (NA 1.4). For image quantification mul- 
tiple fields of view were captured for each 
construct using a CFI Super Plan Fluor ELWD 
40x objective (NA 0.6). The mean fluorescent 
intensity for Tom20 and Tom70 was mea- 
sured by manually drawing a region of in- 
terest around each cell using ImageJ. Between 
30 and 60 cells were quantified for each 
construct. 


Quantification of Tom70 down-regulation in 
infected Caco-2 cells 


Caco-2 cells were seeded on glass coverslips 
in triplicate and infected with SARS-CoV-2 
at a MOI of 0.1 as described above. At 24 hours 
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postinfection, cells were fixed with 4% para- 
formaldehyde and immunostained with anti- 
bodies against Tom70, Tom20, and ORF9b. 
For signal quantification images of nonin- 
fected and neighboring infected cells were 
acquired using a LSM800 confocal laser- 
scanning microscope (Zeiss) equipped with a 
63X, 1.4 NA oil objective and the Zen blue 
software (Zeiss). The mean fluorescence inten- 
sity of each cell was measured by ImageJ 
software. Forty-three cells were quantified for 
each condition—infected or noninfected—from 
three independent experiments. 


Coexpression and purification of ORF9b-Tom70 
(residues 109 to end) complexes 


SARS-CoV-2 ORF9b and Tom70 (residues 109 
to end) were coexpressed using a pET29-b(+) 
vector backbone where ORF9b was tag-less 
and Tom’70 had an N-terminal 10XHis-tag and 
SUMO-tag. LOBSTR E. coli cells transformed 
with the above construct were grown at 37°C 
until they reached an optical density at 600 nm 
(OD¢o0) of 0.8, then expression was induced 
at 37°C with 1 mM IPTG for 4 hours. Frozen 
cell pellets were resuspended in 25 ml of lysis 
buffer (200 mM NaCl, 50 mM Tris-HCl pH 8.0, 
10% v/v glycerol, 2 mM MgCl.) per liter of cell 
culture, supplemented with cOmplete protease 
inhibitor tablets (Roche), 1 mM phenylmethyl- 
sulfonyl fluoride (PMSF) (Sigma), 100 ug/ml 
lysozyme (Sigma), 5 ug/ml DNasel (Sigma), 
and then homogenized with an immersion 
blender (Cuisinart). Cells were lysed by 3x pas- 
sage through an Emulsiflex C3 cell disrup- 
tor (Avestin) at ~103,000 kPa, and the lysate 
clarified by ultracentrifugation at 100,000 x g 
for 30 min at 4°C. The supernatant was col- 
lected, supplemented with 20 mM imidazole, 
loaded into a gravity flow column containing 
Ni-NTA superflow resin (Qiagen), and rocked 
with the resin at 4°C for 1 hour. After allow- 
ing the column to drain, resin was rinsed 
twice with 5 column volumes (cv) of wash 
buffer [150 mM KCl, 30 mM Tris-HCl pH 8.0, 
10% v/v glycerol, 20 mM imidazole, 0.5 mM 
tris(hydroxypropyl)phosphine (THP, VWR)] 
supplemented with 2 mM ATP (Sigma) and 
4 mM MgCl,, then washed with 5 cv wash 
buffer with 40 mM imidazole. Resin was then 
rinsed with 5 cv Buffer A (50 mM KCl, 30 mM 
Tris-HCl pH 8.0, 5% glycerol, 0.5 mM THP) 
and protein was eluted with 2 x 2.5 cv Buffer A 
plus 300 mM imidazole. Elution fractions were 
combined, supplemented with Ulp1 protease, 
and rocked at 4°C for 2 hours. Ulp1-digested 
Ni-NTA eluate was diluted 1:1 with additional 
Buffer A, loaded into a 50 ml Superloop, and 
applied to a MonoQ 10/100 column on an Akta 
pure system (GE Healthcare) using 100% Buf- 
fer A, 0% Buffer B (1000 mM KCl, 30 mM Tris- 
HCl pH 8.0, 5% glycerol, 0.56 mM THP). The 
MonoQ column was washed with 0 to 40% 
Buffer B gradient over 15 cv, peak fractions 
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were analyzed by SDS-PAGE and the identity 
of tagless Tom’70 (109 to end) and ORF9b pro- 
teins confirmed by intact protein MS (Xevo 
G2-XS Mass Spectrometer, Waters). Peak frac- 
tions eluting at ~15% B contained relatively 
pure Tom70 (109 to end) and ORF9b, and these 
were concentrated using 10 kDa Amicon cen- 
trifugal filter (Millipore) and further purified 
by size exclusion chromatography using a Sup- 
erdex 200 increase 10/300 GL column (GE 
Healthcare) in buffer containing 150 mM KCl, 
20 mM HEPES-NaOH pH 7.5, 0.5 mM THP. 
The sole size-exclusion peak contained both 
Tom70 (109 to end) and ORF9b, and the cen- 
ter fraction was used directly for cryo-EM grid 
preparation. 


Expression and purification of 
SARS-CoV-2 ORF9b 


ORF9b with N-terminal 10XHis-tag and SUMO- 
tag was expressed using a pET-29b(+) vector 
backbone. LOBSTR £. coli cells transformed 
with the above construct were grown at 37°C 
until they reached an optical density at 600 nm 
(OD¢600) of 0.8, then expression was induced at 
37°C with 1 mM IPTG for 6 hours. Frozen cell 
pellets were lysed, homogenized, clarified, and 
subject to Ni affinity purification as described 
above for ORF9b-Tom70 complexes, with sev- 
eral small changes. Lysis buffers and Ni-NTA 
wash buffers contained 500 mM NaCl, and an 
additional wash step using 10 cv wash buffer 
plus 0.2% TWEEN20 plus 500 mM NaCl was 
carried out before the ATP wash. ORF9b was 
eluted from Ni-NTA resin in Buffer A (50 mM 
NaCl, 25 mM Tris pH 8.5, 5% glycerol, 0.5 mM 
THP) supplemented with 300 mM imidazole. 
This eluate was diluted 1:1 with additional 
Buffer A, loaded into a 50 ml Superloop, and 
applied to a MonoQ 10/100 column on an Akta 
pure system (GE Healthcare) using 100% 
Buffer A, 0% Buffer B (1000 mM NaCl, 25mM 
Tris-HCl pH 8.5, 5% glycerol, 0.5 mM THP). 
The MonoQ column was washed with O to 
40% Buffer B gradient over 15 cv, and rela- 
tively pure ORF9b eluted at 20 to 25% Buffer 
B, whereas ORF9b and contaminating pro- 
teins eluted at 30 to 35% buffer B. Fractions 
from these two peaks were combined and in- 
cubated with Ulpl and HRV3C proteases at 
4°C for 2 hours, supplemented with 10 mM 
imidazole, then thrice flowed back through 
1 ml of Ni-NTA resin equilibrated with size- 
exclusion buffer (as above) plusl10 mM imid- 
azole. The reverse-Ni purified sample was 
concentrated using 10 kDa Amicon centrifugal 
filter and then further purified by size exclu- 
sion chromatography using a Superdex 200 
increase 10/300 GL column. 


Expression and purification of Tom70 (109-end) 


Tom’70 (109 to end) with N-terminal 10XHis- 
tag and SUMO-tag and C terminus Spy-tag, 
HRV-3C protease cleavage site, and eGFP-tag 
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was expressed using a pET-21(+) vector back- 
bone. LOBSTR E. coli cells transformed with 
the above construct were grown at 37°C until 
they reached an optical density at 600 nm 
(ODgo0) of 0.8, then expression was induced 
at 16°C with 0.5 mM IPTG overnight. The 
soluble domain of Tom70 [Tom70 (109-end)] 
was purified as described in (62) with some 
modifications. Frozen cell pellets of LOBSTR 
E. coli transformed with the above construct 
were resuspended in 50 ml lysis buffer (500 mM 
NaCl, 20 mM KH.PO, pH 7.5) per liter cell cul- 
ture, supplemented with 1 mM PMSF (Sigma) 
and 100 ug/ml, and homogenized. Cells were 
lysed by 3x passage through an Emulsiflex C3 
cell disruptor (Avestin) at ~103,000 kPa, and 
the lysate clarified by ultracentrifugation at 
100,000 x g for 30 min at 4°C. The superna- 
tant was collected, supplemented with 20 mM 
imidazole, loaded into a gravity flow column 
containing Ni-NTA superflow resin (Qiagen), 
and rocked with the resin at 4°C for 1 hour. 
After allowing the column to drain, resin was 
rinsed twice with 5 column volumes (cv) of 
wash buffer (500 mM KCI, 20 mM KH PO, 
pH 8.0, 20 mM imidazole, 0.5 mM THP) sup- 
plemented with 2 mM ATP (Sigma) and 4 mM 
MgCl, then washed with 5 cv wash buffer with 
40 mM imidazole. Bound Tom’70 (109 to end) 
was then cleaved from the resin by 2-hour in- 
cubation with Ulp1 protease in 4 cv elution 
buffer 150 mM KCl, 20 mM KH.PO, pH 8.0, 
5 mM imidazole, 0.5 mM THP). After cleav- 
age with Ulpl, the flow through was collected 
along with a 2-cv rinse of the resin with ad- 
ditional elution buffer. These fractions were 
combined and HRV3C protease was added 
to remove the C-terminal EGFP tag (1:20 
HRV3C to Tom70). After 2-hour HRV3C di- 
gestion at 4°C, the double-digested Tom’70 
(109 to end) was concentrated using a 30 kDa 
Amicon centrifugal filter (Millipore) and further 
purified by size exclusion chromatography 
using a Superdex 200 increase 10/300 GL 
column (GE Healthcare) in buffer containing 
150 mM KCl, 20 mM HEPES-NaOH pH 7.5, 
0.5 mM THP. 


Prediction of SARS-CoV-2 ORF9b internal 
mitochondrial targeting sequence 


ORF9b was analyzed for the presence of an 
internal mitochondrial targeting sequence 
Gi-MTS) as described in (63) using the TargetP- 
2.0 server (64). Sequences corresponding to 
ORF9b N-terminal truncations of 0 to 62 resi- 
dues were submitted to the TargetP-2.0 server, 
and the probability of the peptides containing 
an MTS plotted against the numbers of resi- 
dues truncated. A similar analysis using the 
MitoFates server (65) predicted that ORF9b 
residues 54 to 63 were the most likely to make 
up a presequence MTS on the basis of their 
propensity to form a positively charged amphi- 
pathic helix. Notably this analysis was con- 
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sistent with the secondary structure prediction 
from JPRED (66). 


Cryo-EM sample preparation and data collection 


Three ul of ORF9b-Tom70 complex (12.5 uM) was 
added to a 400 mesh 1.2/1.3R Au Quantifoil 
grid previously glow discharged at 15 mA for 
30 s. Blotting was performed with a blot force 
of O for 5 s at 4°C and 100% humidity in a FEI 
Vitrobot Mark IV (ThermoFisher) before plunge 
freezing into liquid ethane. A total of 1534 
118-frame super-resolution movies were collected 
with a3 by 3 image-shift collection strategy at 
a nominal magnification of 105,000x (physical 
pixel size: 0.834 A per pixel) on a Titan Krios 
(ThermoFisher) equipped with a K3 camera and 
a Bioquantum energy filter (Gatan) set to a slit 
width of 20 eV. Collection dose rate was 8 elec- 
trons per pixel per second for a total dose of 66 
electrons per square angstrom. Defocus range was 
—0.7 to —2.4 um. Each collection was performed 
with semiautomated scripts in SerialEM (67). 


Cryo-EM image processing and model building 


We motion corrected 1534 movies using 
Motioncor2 (68), and we imported dose-weighted 
summed micrographs in cryosparc (version 
2.15.0). Then, 1427 micrographs were curated 
on the basis of contrast transfer function 
(CTF) fit (better than 5 A) from a patch CTF 
job. Template-based particle picking resulted 
in 2,805,121 particles, and 1,616,691 particles 
were selected after 2D classification. Five 
rounds of 3D classification using multiclass 
ab initio reconstruction and heterogeneous 
refinement yielded 178,373 particles. Homo- 
geneous refinement of these final particles led 
to a 3.1-A electron density map which was used 
for model building. The reconstruction was fil- 
tered by the masked Fourier shell correlation 
(FSC) and sharpened with a b-factor of -145. 

To build the model of Tom’70 (109 to end), 
the crystal structure of Saccharomyces cerevisiae 
Tom71 (PDB ID: 3fp3; sequence identity 25.7%) 
was first fit into the cryo-EM density as a rigid 
body in UCSF ChimeraX and then relaxed into 
the final density using Rosetta FastRelax mover 
in torsion space. This model, along with a BLAST 
alignment of the two sequences (69), was used as 
a starting point for manual building using COOT 
(70). After initial building by hand, the regions 
with poor density fit or geometry were itera- 
tively rebuilt using Rosetta (77). ORF9b was 
built de novo into the final density using COOT, 
informed and facilitated by the predictions of 
the TargetP-2.0, MitoFates, and JPRED servers. 
The ORF9b-Tom70 complex model was sub- 
mitted to the Namdinator web server (72) and 
further refined in ISOLDE 1.0 (73) using the 
plugin for UCSF ChimeraX (74). Final model 
b-factors were estimated using Rosetta. The model 
was validated using phenix.validation_cryoem 
(75). The final model contains residues 109 to 
272 and 298 to 600 of human Tom70 and 39 


20 of 25 


RESEARCH | RESEARCH ARTICLE 


to 76 of SARS-CoV-2 ORF9b. Molecular inter- 
face between ORF9b and Tom70 was analyzed 
using the PISA web server (76). Figures were 
prepared using UCSF ChimeraX. 


Computational human genetics analysis 


To look for genetic variants associated with our 
list of proteins that had a meaningful impact 
on SARS-CoV-2 replication, we used the largest 
proteomic GWAS study to date (28). We identi- 
fied ILI7RA as one of the proteins assayed in 
Sun et al.’s proteomic GWAS and observed that 
it had multiple cis-acting protein quantitative 
trait loci (pQTLs) at a corrected P value of 1 x 
10°”, where cis-acting is defined as within 1 Mb 
of the transcription start site of ILI7RA. 

We used the GSMR method (29) to perform 
Mendelian randomization (MR) using near- 
independent [linkage disequilibrium (LD) R? = 
0.05, where R? is the coefficient of determi- 
nation] cis-pQTLs for JL17RA. The advantage 
of the GSMR method over conventional MR 
methods is twofold. First, GSMR performs MR 
adjusting for any residual correlation between 
selected genetic variants by default. Second, 
GSMR has a built-in method called HEIDI 
(heterogeneity in dependent instruments)- 
outlier that performs heterogeneity tests in 
the near-independent genetic instruments and 
removes potentially pleiotropic instruments 
(i.e., where there is evidence of heterogeneity 
at P < 0.01). Details of the GSMR and HEIDI 
method have been published previously (29). 

Summary statistics generated by COVID- 
HGI (round 3; www.covid19hg.org/results/) 
for COVID-19 versus population, hospitalized 
COVID-19 versus population and hospitalized 
COVID-19 versus nonhospitalized COVID-19 
were used for JLI7RA MR analysis. We used 
the 1000 genomes phase 3 European popula- 
tion genotype data to derive the LD correlation 
matrix for this analysis. The phenotype defini- 
tions as provided by COVID-HGI are as follows. 
COVID-19 versus population: Case, individuals 
with laboratory confirmation of SARS-CoV-2 
infection, EHR/ICD coding/Physician-confirmed 
COVID-19, or self-reported COVID-19 positive; 
control, everybody that is not a case. Hospi- 
talized COVID-19 versus population: case, hos- 
pitalized, laboratory confirmed SARS-CoV-2 
infection or hospitalization due to COVID- 
19-related symptoms; control, everybody that 
is not a case, e.g., population. Hospitalized 
COVID-19 versus nonhospitalized COVID-19: 
case, hospitalized, laboratory confirmed SARS- 
CoV-2 infection or hospitalization due to 
COVID-19-related symptoms; control, labora- 
tory confirmed SARS-CoV-2 infection and not 
hospitalized 21 days after the test. 


Infections and treatments for IL-17A 
treatment studies 


The WA-1 strain (BEI resources) of SARS-CoV-2 
was used for all experiments. All live virus 
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experiments were performed in a BSL3 labo- 
ratory. SARS-CoV-2 stocks were passaged in 
Vero E6 cells (ATCC) and titer was determined 
via plaque assay on Vero E6 cells as previously 
described (77). Briefly, virus was diluted 1:10 
to 1:10° and incubated for 1 hour on Vero E6 
cells before an overlay of Avicel and complete 
DMEM (Sigma Aldrich, SLM-241) was added. 
After incubation at 37°C for 72 hours, the 
overlay was removed and cells were fixed with 
10% formalin, stained with crystal violet, and 
counted for plaque formation. SARS-CoV-2 in- 
fections of A549-ACE2 cells were done at a 
MOI of 0.05 for 24 hours. Inhibitors and cyto- 
kines were added concurrently with virus. All 
infections were done in technical triplicate. Cells 
were treated with the following compounds: 
Remdesivir (SELLECK CHEMICALS LLC, $8932) 
and IL-17A (Millipore-Sigma, SRP0675). 


RNA extraction, RT, and RT-qPCR for IL-17A 
treatment studies 


Total RNA from samples was extracted using 
the Direct-zol RNA kit (Zymogen, R2060) 
and quantified using the NanoDrop 2000c 
(ThermoFisher). cDNA was generated using 
500 ng of RNA from infected A549-ACE2 cells 
with Superscript III reverse transcription 
(ThermoFisher, 18080-044:) and oligo(dT),9-18 
(ThermoFisher, 18418-012) and random hexa- 
mer primers (ThermoFisher, $0142). RT-qPCR 
reactions were performed on a CFX384 (BioRad) 
and delta cycle threshold (ACt) was deter- 
mined relative to RPLI3A levels. Viral detection 
levels and target host genes in treated samples 
were normalized to water-treated controls. The 
SYBR green qPCR reactions contained 5 ul of 2x 
Maxima SYBR green/Rox qPCR Master Mix 
(ThermoFisher; K0221), 2 ul of diluted cDNA, 
and 1 nmol of both forward and reverse pri- 
mers, in a total volume of 10 ul. The reactions 
were run as follows: 50°C for 2 min and 95°C 
for 10 min, followed by 40 cycles of 95°C for 5 s 
and 62°C for 30 s. Primer efficiencies were 
~100%. Dissociation curve analysis after the 
end of the PCR confirmed the presence of a 
single and specific product. RT-qPCR primers 
were used against the SARS-CoV-2 E gene 
(PF_042_nCoV_E_F: ACAGGTACGTTAATAGT- 
TAATAGCGT; PF_042_ nCoV_E_R: ATATTG- 
CAGCAGTACGCACACA), the CXCL8 gene 
(CXCL8 For: ACTGAGAGTGATTGAGAGTG- 
GAC; CXCL8 Rev: AACCCTCTGCACCCAGTT- 
TTC), and the RPLI3A gene (RPLI3A For: 
CCTGGAGGAGAAGAGGAAAGAGA; RPLI13A 
Rev: TTGAGGACCTCTGTGTATTTGTCAA). 


Transfections for IL-17A treatment studies 


HEK293T cells were seeded 5 x 10°cells per 
well (in 6-well plate) or 3 x 10° cells per 10-cm” 
plates. The next day, 2 or 10 ug of plasmids was 
transfected using X-tremeGENE 9 DNA Trans- 
fection Reagent (Roche) in 6-well plate or 
10-cm” plates, respectively. For IL-17A (Millipore- 
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Sigma, SRP0675) incubation in cells, 0.5 ug of 
IL-17A was treated either before or after trans- 
fection and incubated at 37°C. After 48 hours, 
cells were collected by trypsinization. For 
IL-17A incubation with cell lysates, transfected 
cell lysates were incubated in the presence 
of 0.5 or 5 ug/ml IL-17A at 4°C on a rotator 
overnight. Plasmids pLVX-EFlalpha-SARS-CoV- 
2-orf8-2xStrep-IRES-Puro (ORF8) and pLVX- 
EFlalpha-eGFP-2xStrep-IRES-Puro (EGFP-Strep) 
were a gift from N. J. Krogan. (Addgene plasmid 
#141390, 141395) (5). pLVX-EF lalpha- IRES-Puro 
(Vector) was obtained from Takara/Clontech. 


SARS-CoV-2 ORF8 and IL17RA 
coimmunoprecipitation 


Transfected and treated HEK293T cells were 
pelleted and washed in cold D-PBS and later 
resuspended in Flag-IP Buffer (50 mM Tris 
HCl, pH 7.4, with 150 mM NaCl, 1 mM EDTA, 
and 1% NP-40) with Ix HALT (ThermoFisher 
Scientific, 78429), incubated with buffer for 
15 min on ice then centrifuged at 13,000 rpm 
for 5 min. The supernatant was collected and 
1 mg of protein was used for immunoprecipi- 
tation (IP) with 100 ul of Streptactin Sepharose 
(IBA, 2-1201-010) on a rotor overnight at 4°C. 
Immunoprecipitates were washed five times 
with Flag-IP buffer and eluted with 1x Buffer E 
(100 mM Tris-Cl, 150 mM NaCl, 1 mM EDTA, 
2.5 mM Desthiobiotin). Eluate was diluted with 
Ix-NuPAGE (ThermoFisher Scientific, #+NPOOO8) 
LDS Sample Buffer with 2.5% B-Mercaptoethanol 
and blotted for targeted antibodies. Antibodies 
used were Strep tag IT (Qiagen, #34850), B-Actin 
(Sigma, #A5316), and ILI7RA (Cell Signaling, 
#12661S). 


Computational docking of PGES-2 and Nsp7 


A model for human PGES-2 dimer was con- 
structed by homology using MODELER (78) 
from the crystal structure of Macaca fascularis 
mPGES-2 [PDB ID: 1Z9H (79); 98% sequence 
identity] bound to indomethacin. Indomethacin 
was removed from the structure utilized for 
docking. The structure of SARS-CoV-2 Nsp7 
was extracted from PDB ID 7BV2 (80). Dock- 
ing models were produced using ClusPro 
(81), ZDock (82), HDock (83), Gramm-X (84), 
SwarmDock (85), and PatchDock (86) with 
SOAP-PP score (87). For each protocol, up to 
100 top scoring models were extracted (fewer 
for those that do not report >100 models); for 
PatchDock, models with SOAP-PP g-scores 
>3.0 were used (fig. S23A). The 420 models 
were clustered at 4.0-A RMSD, resulting in 
127 clusters. The two largest clusters, composed. 
of 192 models, are related by dimer symmetry. 
All other clusters contain <15 models. 


Assessment of positive selection signatures 
in SIGMARI 


SIGMARI] protein alignments were generated 
from whole-genome sequences of 359 mammals 
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curated by the Zoonomia consortium. Pro- 
tein alignments were generated with TOGA 
(https://github.com/hillerlab/TOGA), and miss- 
ing sequence gaps were refined with CACTUS 
(88, 89). Branches undergoing positive selec- 
tion were detected with the branch-site test 
aBSREL (90) implemented in the HyPhy pack- 
age (90, 91). PhyloP was used to detect codons 
undergoing accelerated evolution along branches 
detected as undergoing positive selection by 
aBSREL relative to the neutral evolution rate 
in mammals, determined using phyloFit on 
third nucleotide positions of codons which 
are assumed to evolve neutrally. P values from 
phyloP were corrected for multiple tests using 
the Benjamini-Hochberg method (92). PhyloFit 
and phyloP are both part of the PHAST pack- 
age version 1.4 (93, 94). 


Comparative SARS-CoV-1 inhibition 
by amiodarone 


SARS-CoV-1 (Urbani) drug screens were performed 
with Vero E6 cells (ATCC #1568, Manassas, 
VA) cultured in DMEM (Quality Biological), 
supplemented with 10% (v/v) heat-inactivated 
FBS (Sigma), 1% (v/v) penicillin-streptomycin 
(Gemini Bio-products), and 1% (v/v) L-glutamine 
(2 mM final concentration, Gibco). Cells were 
plated in opaque 96-well plates 1 day before 
infection. Drugs were diluted from stock to 
50 uM and an 8-point 1:2 dilution series pre- 
pared in duplicate in Vero Media. Every com- 
pound dilution and control were normalized 
to contain the same concentration of drug ve- 
hicle (e.g., DMSO). Cells were pretreated with 
drug for 2 hours at 37°C (5% CO2) before in- 
fection with SARS-CoV-1 at MOI 0.01. In addi- 
tion to plates that were infected, parallel plates 
were left uninfected to monitor cytotoxicity 
of drug alone. All plates were incubated at 
37°C (5% CO2) for 3 days before performing 
CellTiter-Glo (CTG) assays as per the manu- 
facturer’s instruction (Promega, Madison, WI). 
Luminescence was read on a BioTek Synergy 
HTX plate reader (BioTek Instruments Inc., 
Winooski, VT) using the Gen5d software (version 
7.07, Biotek Instruments Inc., Winooski, VT). 


Real-world data source and analysis 


This study used deidentified patient-level re- 
cords from HealthVerity’s Marketplace dataset, 
a nationally representative dataset covering 
>300 million patients with medical and phar- 
macy records from >60 health care data sources 
in the United States. The current study used 
data from 738,933 patients with documented 
COVID-19 infection between 1 March 2020 
and 17 August 2020, defined as a positive or 
presumptive positive viral laboratory test result 
or an International Classification of Diseases, 
10th Revision, Clinical Modification (ICD-10- 
CM) diagnosis code of U07.1 (COVID-19). 

For this population, we analyzed medical 
claims, pharmacy claims, laboratory data, and 
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hospital chargemaster data containing diag- 
noses, procedures, medications, and COVID- 
19 laboratory results from both inpatient and 
outpatient settings. Claims data included open 
(unadjudicated) claims sourced in near-real 
time from practice management and billing 
systems, claims clearinghouses, and labora- 
tory chains, as well as closed (adjudicated) 
claims encompassing all major U.S. payer types 
(commercial, Medicare, and Medicaid). For 
inpatient treatment evaluations, we used linked 
hospital chargemaster data containing records 
of all billable procedures, medical services, and 
treatments administered in hospital settings. 
Linkage of patient-level records across these 
data types provides a longitudinal view of base- 
line health status, medication use, and COVID-19 
progression for each patient under study. Data 
for this study covered the period of 1 December 
2018 through 17 August 2020. All analyses 
were conducted with the Aetion Evidence Plat- 
form version r4.6. 

This study was approved by the New England 
institutional review board (IRB) (no. 1-9757-1). 
Medical records constitute protected health in- 
formation and can be made available to qua- 
lified individuals on reasonable request. 


Observation of hospitalization outcomes 
in outpatient new users of indomethacin 
(treatment arm) versus celecoxib (active 
comparator) using real-world data 


We used an incident (new) user, active com- 
parator design (95, 96) to assess the risk of hos- 
pitalization among newly diagnosed COVID-19 
patients who were subsequently treated with 
indomethacin or the comparator agent, cele- 
coxib. Patients were required to have COVID-19 
infection recorded in an outpatient setting 
during the study period of 1 March 2020 to 
17 August 2020 and occurring in the 21 days 
before (and including) the date of indomethacin 
or celecoxib treatment initiation. Prevalent 
users of prescription-only NSAIDs (any prescript- 
ion fill for indomethacin, celecoxib, ketoprofen, 
meloxicam, sulindac, or piroxicam 60 days 
prior) and patients hospitalized in the 21 days 
before and including the date of treatment 
initiation were excluded from this analysis. 
Using RSS, patients treated with indometh- 
acin were matched at a 1:1 ratio to controls 
randomly selected among patients treated with 
celecoxib, with direct matching on calendar 
date of treatment (+7 days), age (+5 years), sex, 
Charlson comorbidity index (exact) (97), time 
since confirmed COVID-19 (+5 days), and dis- 
ease severity based on the highest-intensity 
COVID-19-related health service in the 7 days 
before and including the date of treatment 
initiation (laboratory service only versus out- 
patient medical visit versus emergency depart- 
ment visit) and symptom profile in the 21 days 
before and including the date of treatment 
initiation (recorded symptoms versus none). 
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This risk-set-sampled population was further 
matched on a PS (33) estimated using logistic 
regression with 24 demographic and clinical 
risk factors, including covariates related to 
baseline medical history and COVID-19 sever- 
ity in the 21 days before treatment (table S11). 
Balance between indomethacin and celecoxib 
treatment groups was evaluated by compari- 
son of absolute standardized differences in 
covariates, with an absolute standardized dif- 
ference of <0.2 indicating good balance be- 
tween the treatment groups (98). 

The primary analysis was an intention-to- 
treat design, with follow-up beginning 1 day 
after indomethacin or celecoxib initiation and 
ending on the earliest occurrence of 30 days of 
follow-up reached or end of patient data. ORs 
for the primary outcome of all-cause inpatient 
hospitalization were estimated for the RSS- 
plus-PS-matched population as well as for the 
RSS-matched population. Our primary out- 
come definition required a record of inpatient 
hospital admission with a resulting inpatient 
stay; aS a Sensitivity, a broader outcome de- 
finition captured any hospital visit (defined 
with revenue and place of service codes). 


Observation of mechanical ventilation outcomes 
in inpatient new users of typical antipsychotics 
(treatment arm) versus atypical antipsychotics 
(active comparator) using real-world data 


We used an incident user, active comparator 
design (95, 96) to assess the risk of mechani- 
cal ventilation among hospitalized COVID-19 
patients treated with typical or atypical anti- 
psychotics in an inpatient setting. See table S11 
for a list of drugs included in each category. To 
permit assessment of day-level in-hospital con- 
founders and outcomes, this analysis was re- 
stricted to hospitalized patients observable in 
hospital chargemaster data. Prevalent users of 
typical or atypical antipsychotics (any prescript- 
ion fill or chargemaster-documented use in 
60 days prior) and patients with evidence of 
mechanical ventilation in the 21 days before 
and including the date of treatment initia- 
tion were excluded from this analysis. 
Using RSS, hospitalized patients treated 
with typical antipsychotics were matched at a 
1:1 ratio to controls randomly selected among 
patients treated with atypical antipsychotics, 
with direct matching (1:1 fixed ratio) on calen- 
dar date of treatment (+7 days), age (+5 years), 
sex, Charlson comorbidity index (exact) (97), 
time since hospital admission, and disease 
severity as defined with a simplified version 
of the World Health Organization’s ordinal 
scale for clinical improvement (99). This risk- 
set-sampled population was further matched 
on a PS estimated using logistic regression 
with 36 demographic and clinical risk factors, 
including covariates related to baseline med- 
ical history, admitting status, and disease sev- 
erity at treatment (table S11). Balance between 
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typical and atypical treatment groups was 
evaluated by comparison of absolute stand- 
ardized differences in covariates, with an ab- 
solute standardized difference of <0.2 indicating 
good balance between the treatment groups (98). 

The primary analysis was an intention-to- 
treat design, with follow up beginning 1 day 
after the date of typical or atypical antipsy- 
chotic treatment initiation and ending on the 
earliest occurrence of 30 days of follow-up 
reached, discharge from hospital, or end of 
patient data. ORs for the primary outcome of 
inpatient mechanical ventilation were estimated 
for the RSS-plus-PS-matched population as well 
as for the RSS-matched population. 
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Enteroviral 3C protease activates the human NLRP1 
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INTRODUCTION: The innate immune system 
uses a variety of sensor proteins to detect in- 
fections and injuries. The nucleotide-binding oligo- 
merization domain, leucine-rich repeat, and 
pyrin domains-containing proteins (NLRPs) are a 
family of cytosolic sensors involved in the initia- 
tion of the host innate immune response. Upon 
activation, NLRPs form a multiprotein immune 
effector complex known as the inflammasome. 
Assembly of the inflammasome leads to caspase-1 
activation, the secretion of proinflammatory cyto- 
kines such as interleukin-18 (IL-18) and IL-18, and 
gasdermin D-dependent pyroptotic cell death. 
These processes orchestrate the downstream 
immune response to microbial infections and 
injuries. Genetic data support an important role 
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for human NLRPSs in antimicrobial defense. Dys- 
regulated NLRP-driven inflammasome signaling 
also contributes to the pathogenesis of several 
autoimmune and autoinflammatory diseases. 


RATIONALE: Understanding the molecular basis 
by which individual human NLRPs respond to 
specific stimuli is key to the study of innate im- 
munity and inflammatory diseases. NLRP1 was 
one of the first human NLRPs to be described. 
Yet the pathogen-derived triggers for human 
NLREP1 have not been identified, and its mech- 
anism of activation is much less understood than 
that of other NLRPs such as NLRP3. Although 
compelling evidence shows that rodent NLRP1 
can be activated by specific bacterial toxins and 
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Proposed mechanism of NLRP1 activation by 3Cpro in HRV-infected human airway epithelium. TLR, 
Toll-like receptor; RLR, retinoic acid-inducible gene l-like receptors; Pyd, pyrin domain; LRR, leucine-rich 
repeat; FIIND, function to find domain; CARD, caspase recruitment domain; ASC, apoptosis-associated 
speck-like protein containing a CARD A, Ala; C, Cys; E, Glu; G, Gly; Q, Gln; R, Arg; S, Ser; T, Thr. 
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protozoan infections, human NLRPI is not 
triggered by these stimuli. Germline-activating 
NLRP1 mutations in humans cause a variety of 
inherited diseases characterized by sustained 
inflammation and hyperplasia in the skin and 
the airway. Thus, we hypothesized that some 
unknown molecule(s) from common skin or 
airway pathogens activate human NLRPI. 


RESULTS: Here, we report that human NLRP1 
senses and is activated by enteroviral infections. 
Enteroviruses cause a range of human diseases, 
including hand-foot-and-mouth disease, polio, 
and the common cold. In our model system, we 
show that NLRP1 is highly expressed in airway 
epithelia and is activated by the human rhino- 
virus (HRV), a type of enterovirus that causes the 
common cold. During infection, a HRV compo- 
nent known as 3C protease (HRV-3Cpro) cleaves 
NLRPI1 specifically between amino acids Q130 
and G131. This cleavage causes the autoinhibitory 
N-terminal fragment (amino acids 131 to 1212) to 
become recognized by the N-terminal glycine 
degron pathway and subsequently cleared by 
the proteasome. This sequence of events culmi- 
nates in inflammasome activation mediated by 
the liberated NLRP1 C-terminal fragment (amino 
acids 1213 to 14:74), resulting in caspase-1 activa- 
tion, inflammatory cell death, and the secretion 
of inflammatory cytokines, such as IL-18, from 
infected airway epithelial cells. We also show 
that pharmacologic inhibition of the N-terminal 
glycine degron pathway by the NEDD8-cullin 
inhibitor MLN4924: effectively blocks the activa- 
tion of NLRP1 inflammasome in HRV-infected 
primary human bronchial epithelial cells. 


CONCLUSION: We establish that enteroviral 
3Cpros, which have long been regarded as a key 
mediator of viral immune evasion, in fact act as a 
trigger for the NLRP1 inflammasome in primary 
human airway epithelial cells. These results provide 
a unified mechanism for proteolysis-mediated 
activation of NLRP1 in humans and rodents. Our 
results also reveal a role for the recently described 
N-terminal glycine degron pathway in human in- 
nate immunity. Thus, NLRP1 is a versatile sensor 
protein, which has evolved to respond to bacte- 
rial, viral, and protozoan infections in different 
species. As both HRV infection and inflammasome- 
dependent IL-1 cytokines are associated with acute 
exacerbations of asthma and chronic obstructive 
pulmonary disease, the NLRP1 inflammasome 
pathway holds potential as a therapeutic tar- 
get in the treatment of inflammatory diseases 
involving the airway epithelia. 
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Immune sensor proteins are critical to the function of the human innate immune system. The full 
repertoire of cognate triggers for human immune sensors is not fully understood. Here, we report 
that human NACHT, LRR, and PYD domains-containing protein 1 (NLRP1) is activated by 3C proteases 
(3Cpros) of enteroviruses, such as human rhinovirus (HRV). 3Cpros directly cleave human NLRP1 at a 
single site between Glut? and Gly’*". This cleavage triggers N-glycine—mediated degradation of the 


autoinhibitory NLRP1 N-terminal fragment via the cullin 


ZERI/ZYGHB complex, which liberates the activating 


C-terminal fragment. Infection of primary human airway epithelial cells by live human HRV triggers 
NLRP1-dependent inflammasome activation and interleukin-18 secretion. Our findings establish 3Cpros 
as a pathogen-derived trigger for the human NLRP1 inflammasome and suggest that NLRP1 may 


contribute to inflammatory diseases of the airway. 


he human innate immune system uses 

a multitude of sensor proteins to detect 

microbial infections and kick-start the 

immune response (J). Nod-like receptor 

(NLR) proteins are a family of innate im- 
mune sensors that can detect pathogen- and 
danger-associated molecular patterns (1-3). 
Upon activation, NLR proteins nucleate the 
assembly of inflammasome complexes, lead- 
ing to pyroptotic cell death and secretion of 
processed inflammatory cytokines, such as 
interleukin-18 (IL-18) and IL-18 (4). Among 
human NLR sensors, NLRP1 remains one of 
the few whose cognate trigger has not been 
identified. Human NLRP!I1 differs from rodent 
homologs in terms of domain organization, 
ligand specificity, and tissue distribution (5), 
and its exact role in human antimicrobial re- 
sponse is still unclear. However, NLRP1 does 
play an important role in inflammatory disor- 
ders. Germline-activating mutations in NLRPI 
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cause a number of Mendelian syndromes char- 
acterized by multiple self-healing keratoacan- 
thomas of the skin and hyperkeratosis in the 
laryngeal and corneal epithelia (6-8). Carriers 
of certain common NZIRAPI single-nucleotide 
polymorphisms experience increased risks for 
autoimmune diseases such as asthma and 
vitiligo (9, 10). 

Anthrax lethal factor (LF) is the most well- 
characterized trigger for murine NLRPIB (71-15). 
LF directly cleaves NLRPIB close to its N terminus 
(13-15). This cleavage causes N-degron-mediated. 
degradation of the autoinhibitory N-terminal 
fragment, freeing the noncovalently bound 
FIINDY?“-CARD fragment [FIIND, function 
to find domain; CARD, caspase recruitment 
domain; amino acids 1213 to 1474] to activate 
caspase-1 (16-18) (Fig. 1A). The consensus LF 
cleavage site is absent in human NLRPI. As a 
result, LF does not cleave or activate human 
NLRP1. Human NLRP!1 also contains an N- 
terminal pyrin domain (PYD) not found in ro- 
dents (Fig. 1A). Despite these differences, both 
human NLRP1 and rodent homologs can be 
activated by chemical inhibitors of dipeptidases 
DPP8 and DPP9Y, although the underlying mech- 
anisms remain to be fully elucidated (19-21). 
Rodent Nlrp!1 can also be activated by Toxoplasma 
gondii infection in a process that does not 
appear to involve protease-mediated cleavage 
(22, 23). Thus, multiple modes of activation 
exist for NLRP1 from different species. It 
remains unknown whether any naturally 
occurring pathogen-derived molecules can 
activate human NLRPI1. In this study, we set 
out to identify the cognate pathogen-derived 
signal(s) that can activate human NLRP1 and 
assess the mechanisms by which they activate 
the NLRP1 inflammasome. 
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3Cpros activate human NLRP1 in vitro 

From a survey of common human pathogens, 
we selected the human rhinovirus (HRV), which 
is the major causative agent for the common 
cold. HRV is a member of the Picornaviridae 
family of single-stranded RNA viruses, which 
cause a wide range of human diseases, includ- 
ing hand-foot-and-mouth disease, peri- and 
myocarditis, and poliomyelitis (24). HRV in- 
fection of primary human bronchial epithe- 
lial cells can induce caspase-! activation and 
IL-1 secretion (24, 25), although the upstream 
sensing mechanisms are unclear. All picorna- 
viruses, including HRVs, encode two well- 
defined proteases called 2Apro and 3Cpro. 
These proteases are responsible for cleaving 
the viral genomic precursor proteins into in- 
dividual components (26) and have also been 
shown to cleave host proteins to facilitate im- 
mune evasion (27-30). 

To test whether HRV-3Cpro can activate hu- 
man NLRPI, we expressed Myc-tagged HRV- 
3Cpro in a 293T reporter cell line that stably 
expressed ASC-GFP (ASC, apoptosis-associated. 
speck-like protein containing a CARD domain; 
GFP, green fluorescent protein) and NLRPI- 
FLAG (called 293T-ASC-GFP-NLRPI1-FLAG). As 
compared to vector-transfected cells, 3Cpros 
derived from two strains of HRV (HRV-14, 
serotype B, and HRV16, serotype A) and a closely 
related enterovirus (coxsackie B3) induced a 
significant increase in the percentage of cells 
aggregating ASC-GFP specks (Fig. 1B and fig. 
S1, A and B), similar to the dipeptidase in- 
hibitor talabostat. By contrast, none of the 
other viral proteases tested were able to do so 
(Fig. 1B and fig. S1, A and B), despite similar or 
even higher levels of expression. The ability of 
HRV14-3Cpro to activate human NLRP1 was 
entirely dependent on its enzymatic activity, 
because mutating its catalytic cysteine resi- 
due (p.C146A; Cys'*°—Ala) abrogated ASC- 
GFP speck formation (Fig. 1B and fig. SIB). 
In an orthogonal assay for NLRP1 activation, 
HRV14-3CPro and HRV16-3Cpro directly in- 
duced the formation of high-molecular-weight 
NLRP1 oligomers by native polyacrylamide 
gel electrophoresis (PAGE) at the expense of 
monomeric NLRPI1 (fig. S1C). These findings 
establish enteroviral 3Cpros, such as those 
encoded by HRV, as robust activators for the 
reconstituted human NLRP1 inflammasome 
in vitro. 

We and others have recently demonstrated 
that primary and immortalized human kera- 
tinocytes express components of the NLRP1 
inflammasome complex endogenously and can 
undergo rapid pyroptosis in response to in- 
flammasome agonists (7, 19). This provides a 
robust cellular system to determine whether 
3Cpros can activate the endogenous NLRP1 in- 
flammasome. Immortalized human keratino- 
cytes stably expressing GFP-tagged ASC and 
transfected with HRV14-3Cpro formed ASC-GFP 


1 of 12 


RESEARCH | RESEARCH ARTICLE 


A B  293T-ASC-GFP-NLRP1-FLAG C  293T-ASC-GFP 


e vector+HRV14-3Cpro 


100 =» NLRP1-FLAG+HRV14-3Cpro 





Mouse Nirpib 80 
Anthrax LF cleavage Pu 
FIIND auto-cleavage 
| eC -2 60 
| 60 
Functional degradation FIINDY’¢-CARD 
Human NLRP1 


9 40 40 


FIIND auto-cleavage 


+ SE A 7 


FIINDYP*-CARD 


20 


% cells with ASC-GFP specks 


20 


% cells with ASC-GFP specks 








Ro CR ¥ AVAR AR 
@ © Cv oO 
Transfected: VP £ Sra? Lf SOL or MG132 -+ -+ 
NLL VG OND 
D Immortalized human keratinocytes with S 
Tet-ON Myc-HRV14-3Cpro 
sa wk 2 E F 
cx cx cx cx cx @ Untreated 
xm Ke x Ke wo 
aS © © se © © se © m DOX 
PD DW HW HW LW 4 Talabostat t xxx 
DOX - - - + + + = + = 
60 
Talabostat - - - = - = + = ** 
MG132 = = = â = đa = = ka 
ki ™ err 25 
Myc- F P .— — = 
HRV14-3Cpro > 40 
| = -25 T 
ASC & -_--——_«a Á— a È 
> m 
D c 20 
pro-IL-18 © —= =- ae — S — cae cue -°” S 
GAPDH CE ee eee eens a aeee OS S i 
-37 N 
w seen ST N ww 
z & g 
IL-1B 2 . P D ‘ 
— aan < p17 Ce ` 
a ae Fig. 1. Enteroviral 3Cpros activate the human NLRP1 inflammasome. (A) Domain structures 
A ps of human NLRP1 and rodent NLRPIB. Murine NLRPIB is activated by anthrax LF toxin cleavage, 
2 followed by the proteasomal degradation of the autoinhibitory N-terminal fragment. LRR, leucine-rich 
RSG 3 “- repeat. (B) Percentage of 293T-ASC-GFP-NLRP1-FLAG cells with ASC-GFP specks after over- 
= expression of Myc-tagged viral proteases. Cells were fixed 24 hours after transfection of the 
= -50 indicated proteases or empty vector control. Talabostat (2 uM, 24 hours) treatment was used as 
1 2 3 4 5 6 7 8 a positive control. The number of cells with ASC-GFP specks were visually scored with wide- 


field epifluorescence microscopy at 20x magnification. More than 100 cells were scored for 
ASC-GFP speck formation per condition. *P < 0.05, **P < 0.01, ***P < 0.001 [two-way analysis of variance (ANOVA)]; n.s., not significant. Error bars represent SEM 
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specks and underwent membrane perme- 
abilization marked by propidium iodide 
(PI) staining (fig. SID). Using a doxycycline- 
inducible expression system, we showed that 
only keratinocytes expressing active HRV14- 
3Cpro demonstrated cardinal features of pyrop- 
tosis, including the secretion of cleaved, mature 
IL-18 and the formation of detergent-insoluble 
ASC oligomers (Fig. 1D). This occurred despite 
the lower expression of wild-type 3Cpro com- 
pared to its inactive mutant (p.C146A) (Fig. 1D). 
The effect of 3Cpro requires continuous pro- 
teasome activity, as the proteasomal inhibitor 
MG132 significantly blocked HRV-3Cpro- 
induced ASC-GFP speck formation (Fig. 1C) in 
293T-ASC-GFP-NLRP1 cells, as well as NLRP1 
self-oligomerization (fig. SIC) and ASC oligo- 
merization and IL-18 cleavage (Fig. 1D). There- 
fore, 3Cpro-induced human NLRP1 activation 
may also involve a functional degradation 
step that is analogous to LF-induced mouse 
NLREPIB activation. 

HRV14-3Cpro-induced pyroptosis was entirely 
dependent on NLRPI, as its genetic ablation by 
CRISPR-Cas9 (NLRPI'~ Tet-ON HRV14-3Cpro) 
abrogated IL-18 secretion, ASC oligomeriza- 
tion (Fig. LE and fig. S2A), and lytic cell death 
(Fig. IF). In experiments with transiently trans- 
fected HRV14-3Cpro and HRV16-3Cpro, NLRPI, 
ASC, and CASPI deletion all had similar inhibi- 
tory effects (fig. S2, B to D). Therefore, HRV- 
3Cpro can act as a potent trigger of pyroptotic 
cell death in immortalized human keratino- 
cytes by activating the endogenous NLRP1 in- 
flammasome. Similar to reconstituted NLRP1 
in 293T cells (Fig. 1C and fig. S1C), this ef- 
fect required intact proteasome activity (fig. 
S1D). We also observed that a subset of 3Cpro- 
expressing keratinocytes underwent apoptotic 
cell death with annexin V staining but without 
PI inclusion (fig. S2E), in agreement with the 
reported proapoptotic roles of 3Cpros in other 
cell types (31, 32). 


3Cpros cleave human NLRP1 between 
p.Q130 and G131 


Picornaviral 3Cpros, including HRV-3Cpro, 
are cysteine proteases with well-defined cata- 
lytic activity and substrate preferences (26, 33). 
Just as anthrax LF cleaves rodent NLRP1B 
directly (15, 34), we hypothesized that 3Cpros 
could activate human NLRPI via direct cleav- 
age. Overexpressed NLRP1 undergoes auto- 
cleavage within its FIIND and thus appears 
as two bands that differ by ~20 kDa when 
visualized with an N-terminal-specific anti- 
body (35, 36) (Fig. 2, A and B). In the presence 
of HRV14-3Cpro, two additional bands were 
visualized using the same antibody (Fig. 2B). 
By contrast, the C-terminal FIIND’?’-CARD 
fragment remained intact, suggesting that 
the HRV14-3Cpro cleaved NLRPI at a single 
site close to the N terminus and not within the 
FIIND™™-CARD fragment. To visualize the 3Cpro- 
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specific cleavage more precisely, the same ex- 
periment was carried out using the NLRPI??“ 
mutant, which cannot undergo autocleavage 
within the FIIND and thus remained a single 
band by SDS-polyacrylamide gel electropho- 
resis (fig. S3A). With increasing amounts of 
HRV14-3Cpro, NLRP1*??!”“ became cleaved 
into a single proteolytic product, which was 
~20 kDa smaller than full-length NLRPI (fig. 
S3A). In both experiments, the proteolytic band- 
ing patterns could be explained by a single 
cleavage site ~20 kDa from the NLRPI N ter- 
minus. The same cleavage could be observed 
when NLRP1-expressing cell-free lysate was 
incubated with recombinant HRV14-3Cpro 
at 33°C (the preferred temperature for HRV 
infection) (fig. S3B). On the basis of these ob- 
servations, the 3Cpro cleavage site was mapped 
to the linker region immediately after the PYD 
(Fig. 2A), a region that is not conserved in ro- 
dents. As 3Cpros require a glutamine residue at 
the P,’ substrate site (33), each of the 11 gluta- 
mine residues in this linker region was changed 
to alanine by site-directed mutagenesis. Of all 
the mutants tested, only the p.Q130A muta- 
tion abrogated NLRP1 cleavage by HRV14-3Cpro 
(Fig. 2C) and other 3CPros (fig. S3C). We next 
generated reporter 293T-ASC-GFP cells that 
stably expressed either wild-type NLRP1 or the 
uncleavable Q130A mutant (NLRP1%2) (Fig. 
2D). In contrast to wild-type NLRP1, NLRP12??™- 
expressing cells no longer nucleated ASC-GFP 
specks in response to HRV14-3Cpro overex- 
pression. However, their response to talabo- 
stat remained the same (Fig. 2E). Similar results 
were obtained using transfected NLRP1%2°“ 
and HRV16-3Cpro (fig. S3D). 

To study the endogenous inflammasome re- 
sponse, we rescued human NLRPI ^ keratino- 
cytes with either wild-type NLRP1 or the cleavage 
site mutant NLRP1°?™. Wild-type NLRP1, 
but not NLRP1°%?™. restored HRV14-3Cpro- 
triggered IL-18 secretion, ASC oligomeriza- 
tion (Fig. 2F and fig. S4, A and B), and lytic 
cell death (Fig. 2G and fig. S2C). By contrast, 
both wild-type NLRP1 and NLRP12°™ enabled 
NLRPT ^ cells to respond to talabostat (Fig. 2, 
F and G, and fig. S4A). Thus, 3Cpro-triggered 
NLRP1 activation is a result of a discrete cleav- 
age site between amino acids Q130 and G13], 
adjacent to the human-specific PYD. This site 
is not conserved in rodents, and as expected, 
HRV14-3Cpro did not induce pyroptotic cell 
death in murine RAW264.7 cells (fig. S4, C to 
E). This is consistent with the fact that HRV, 
like most pathogenic human enteroviruses, 
does not naturally infect rodents (37). 


Degradation of the 3Cpro cleavage fragment 
(amino acids 131 to 1212) is required for 
NLRP1 activation 


We next sought to further dissect how the 
cleavage of human NLRPI1 by 3Cpros triggers 
proteasome-dependent inflammasome activa- 
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tion. Cleavage of human NLRP1 by HRV-3Cpro 
removes the entire PYD, where most disease- 
causing gain-of-function germline mutations 
occur (Fig. 1A) (6, 7, 38, 39). It is conceivable 
that 3Cpro cleavage relieves an intrinsic auto- 
inhibitory effect of the noncanonical NLRP1 
PYD. Alternatively, the cleavage could trigger 
the destabilization of the entire N-terminal frag- 
ment (between the 3Cpro cleavage site and the 
FIIND autoproteolysis site, amino acids 131 to 
1212). To distinguish between these two pos- 
sibilities, we examined a truncation mutant of 
NLRPI (amino acids 131 to 1474), which mimics 
the major product generated by HRV14-3Cpro 
cleavage except for the initiating methionine. 
Unexpectedly, this mutant did not cause in- 
creased ASC-GFP speck formation relative to 
wild-type NLRP1 in 293T-ASC-GFP cells, despite 
being fully sensitive to talabostat-mediated ac- 
tivation (Fig. 3A and fig. S5A). Thus, the re- 
moval of NLRP1 PYD appears to be insufficient 
by itself to account for HRV-3Cpro-triggered 
NLRP1 activation. 

Recently, the mechanism by which anthrax LF 
activates NLRPIB has been reported (6, 17, 40). 
LF cleavage creates an N-terminal degron that 
is recognized by the type II N-degron recep- 
tors, such as UBR2, which degrade the inhibi- 
tory NLRPIB N-terminal fragment via the 
proteasome (Fig. 3B). We therefore hypothe- 
sized that an analogous pathway may account 
for the ability of HRV-3Cpro to activate human 
NLRPI. However, HRV-3Cpro cleavage of hu- 
man NLRPI generates an N-terminal glycine, 
which is not a canonical type II N-terminal 
degron recognized by related UBR proteins 
(41, 42). A glycine-specific N-degron pathway 
has recently been described (43), which con- 
sists of receptors ZER1 and ZYGI1I1B and their 
partner cullins CUL2 and CULS5 (called 
cullin2Y2"CUB) The cullin®/“"C"® machinery 
ubiquitinates substrate proteins with N- 
terminal glycine residues and causes their 
degradation via the proteasome. In contrast to 
the UBR system, the N-glycine degron path- 
way is not sensitive to type II free amino acids 
but can be inhibited by the NEDD8-cullin 
inhibitor MLN4924 (43) (Fig. 3B). In 293T- 
ASC-GFP-NLRPI reporter cells, MLN4924 
blocked HRV14-3Cpro-induced ASC-GFP speck 
formation (Fig. 3C) in a manner comparable to 
proteasomal inhibitors MG132 and bortezomib. 
MLN4924 similarly blocked HRV14-3Cpro 
NLRP!1 self-oligomerization (Fig. 3D) in 293T- 
NLRP1-FLAG cells. Notably, MLN4924 stabilized. 
the postcleavage NLRPI fragment (corre- 
sponding to amino acids 131 to 1212, taking 
into account FIIND autoproteolysis) equiva- 
lently to MG132 and bortezomib (Fig. 3D). 
Type II N-degron inhibitor phenylalanine had 
no effect on either ASC-GFP speck formation 
or NLRP1 self-oligomerization (Fig. 3, C and 
D). Similarly, in keratinocytes, both MLN4924 
and bortezomib completely blocked mature 
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Fig. 2. 3CPros activate NLRP1 by direct cleavage at a single site between p.Glul130 and Gly131. (A) HRV14- 
3Cpro cleaves NLRPI close to its N terminus. Schematic shows full-length NLRP1 and the antibodies used to 
detect the NLRP1 autoproteolytic fragments. The epitope of the N-terminal NLRP1 antibody is between NLRP1 amino 
acids 130 and 230. Single-letter abbreviations for the amino acid residues are as follows: A, Ala; C, Cys; D, Asp; 

E, Glu; F, Phe; G, Gly; H, His; I, lle; K, Lys; L, Leu; M, Met; N, Asn; P, Pro; Q, Gln; R, Arg; S, Ser; T, Thr; V, Val; 
W, Trp; and Y, Tyr. (B) 293T cells were transfected with C-terminal HA-tagged NLRP1 and Myc-tagged HRV14-3Pro. 
Full-length NLRP1 and its cleavage products were visualized with the N-terminal fragment-specific NLRP1 antibody and 
an antibody against the C-terminal HA tag. Red arrows indicate the proposed proteolytic relationship between the 
observed NLRP1 fragments. Note that the presence of catalytically active 3Cpro decreased the expression of all 
transfected plasmids (see next panel). (C) To map the 3Cpro cleavage site, 293T cells were cotransfected with the 
indicated NLRP1 alanine mutants and HRV14-3Cpro. 3Cpro®!S^ was used as a negative control. Total cell lysates were 
harvested 48 hours after transfection and analyzed by SDS-PAGE and immunoblotting. (D) Q130A abrogates 
HRV14-3Cpro cleavage. 293T-ASC-GFP-NLRP1”'-FLAG and 293T-ASC-GFP-NLRP12"°"-FLAG cells were transfected 
with Myc-HRV14-3Cpro or treated with talabostat for 48 hours. Total cell lysates were analyzed by SDS-PAGE. The 
asterisk symbol indicates nonspecific NLRP1 degradation product. Red arrow, 3Cpro-dependent cleavage product 
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number of cells with ASC-GFP specks were visually scored with wide-field epifluorescence microscopy at 20x 

magnification. **P < 0.01; ****P < 0.0001 (two-way ANOVA). n = 3 independent transfections per drug treatment. Error bars represent SEM. (F) Representative immunoblots 
demonstrate that wild-type NLRP1, but not NLRP1°°" restores 3Cpro-dependent IL-18 secretion and ASC oligomerization. (G) Trypan blue exclusion assay 
demonstrates that wild-type NLRP1, but not NLRP1°8°" restores 3Cpro-triggered lytic cell death in NLRPI-’~ human keratinocytes. ****P < 0.0001 (two-way 
ANOVA). n = 3 independent transfections per drug treatment. Error bars represent SEM. 
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IL-18 secretion and ASC oligomerization upon 
HRV14-3Cpro induction (Fig. 3E and fig. S5C) 
and stabilized the postcleavage NLRP1 frag- 
ment (fig. S5C). MLN4924 had a much smaller 


ZYG11B 


Fig. 3. 3Cpro-triggered A 
NLRP1 activation re- 
quires the N-terminal 
glycine degron pathway. 
(A) Overexpressed NLRP1 
(amino acids 131 to 1474) 
does not cause spontaneous 
ASC-GFP. speck formation. 
293T-ASC-GFP cells were 
transfected with wild-type 
NLRP1 or mutants and 
fixed 48 hours after trans- 
fection. ****P < 0.0001 
(two-way ANOVA). n = 

3 independent transfec- 
tions per drug treatment. 
Error bars represent SEM. 
(B) Summary of the dis- 
tinct pathways regulating 
postcleavage N-terminal 
fragment degradation in 
human NLRP1 and murine 
NLRP1B. The two types 

of N-degron pathway re- 
quire distinct recognition 
receptors and demonstrate 
distinct sensitivities to 
small-molecule inhibitors. 
(C) MLN4924 and protea- 
somal inhibitors abrogate 
3Cpro-induced ASC-GFP 
specks. 293T-ASC-GFP- 
NLRP1-FLAG cells were 
transfected with HRV14- 
3Cpro and treated with 
the indicated drugs 
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SEM. (D) MLN4924 and 

bortezomib inhibit NLRP1 self-oligomerization and stabilize the 3Cpro cleavage 
fragment. 293T-NLRP1-FLAG cells were transfected with HRV14-3Cpro and 
treated with the indicated drugs 16 hours after transfection for another 

24 hours. Lysates were analyzed by SDS-PAGE and immunoblot. The red 
arrowhead indicates the HRV14-3Cpro cleavage product (amino acids 131 to 
1212). (E) IL-18 ELISA of conditioned media. ****P < 0.0001, one-way ANOVA. 
n = 3 independent doxycycline inductions per drug treatment. Error bars 
represent SEM. (F) CullinZ®®”4"C48 but not UBR2, is genetically required for 
3Cpro-induced NLRP1 activation. The indicated CRISPR KO 293T-ASC-GFP 
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effect on talabostat-induced IL-1B secretion in 
keratinocytes (fig. S5B). Thus, the cullin@®*=” 
system does not appear to fully account 
for talabostat-mediated NLRP1 activation. 
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Given the known pleiotropic effects of 
MLN4924, we used CRISPR-Cas9 to delete 
ZERI1 and ZYGUB (called ZZ-dKO; dKO, dou- 
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cells were transfected with wild-type NLRP1 + HRV14-3Cpro (or vector) or with 
NLRP1™”’". Cells were fixed 24 hours after transfection. ASC-GFP specks 
were scored using wide-field microscopy at 20x. ****P < 0.0001, one-way ANOVA. 
n = 3 transfections, >100 cells. Error bars represent SEM. (G) Cullin@&®!/2¥GU8 
but not UBR2, is responsible for the degradation of the 3Cpro cleavage 
product. CRISPR KO 293T-ASC-GFP cells stably expressing NLRP1-FLAG were 
transfected with HRV14-3Cpro. Cell lysates were harvested 48 hours after 
transfection. The red arrowhead indicates the 3Cpro cleavage product (amino 
acids 131 to 1212). 
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(data S1). The percentage of ASC-GFP specks 
in clonal ZZ-dKO 293T-ASC-GFP cells was sig- 
nificantly reduced after coexpression of wild- 
type NLRP1 and HRV14-3Cpro. By contrast, 
UBRI ^ cells were not significantly different 
from Cas9 control 293T-ASC-GFP cells (Fig. 
3F and fig. S5E). As an additional control, we 
examined the patient-derived gain-of-function 
NLRPI1 mutation, p.M77T, which by itself causes 
the destabilization of the N-terminal fragment 
(fig. S5A). Indeed, the percentage of ASC-GFP 
speck-forming cells induced by NLRP1™”? 
did not differ among ZZ-dKO, UBR2~-, and 
control cells (Fig. 3F and fig. S5E). Therefore, 
cullin@®8”2"C" is specifically required for 
HRV14-3Cpro-triggered human NLRP1 acti- 
vation. The 3Cpro-cleaved NLRP1 fragment 
(amino acids 131 to 1212) was significantly 
stabilized in ZZ-dKO cells relative to control 
cells (Fig. 3G) but not in UBR2~~ cells (fig. S5D). 
Thus, the 3Cpro-cleaved NLRP1 N-terminal 
fragment is a substrate for the cullin@®”“"C"™®- 
mediated N-terminal glycine degron pathway. 


Live HRV infection activates the NLRP1 
inflammasome in primary airway epithelial cells 


We next tested the effect of live HRV infection 
in disease-relevant human epithelial cell types. 
In HeLa-Ohio cells overexpressing NLRPI1- 
hemagglutinin tag (HA), robust HRVI16 viral 
replication was achieved 16 hours after inocu- 
lation [multiplicity of infection (MOI) = 1], as 
evidenced by the accumulation of the viral 
capsid protein VP2 (fig. S6A, lanes 4, 5, 10, 
and 11). Viral replication was accompanied 
by the appearance of the ~120-kDa NLRPI1 
cleavage product in infected cells (fig. S6A). 
The small-molecule pan-HRV-3Cpro inhibi- 
tor rupintrivir completely abrogated NLRP1 
cleavage (fig. S6A). Using HeLa-Ohio cells ex- 
pressing ASC-GFP, we tested whether HRV16 
infection could induce NLRP1 inflammasome 
complex assembly. Although HeLa-Ohio cells 
demonstrated less NLRP1 inflammasome acti- 
vation than keratinocytes or HEK293T cells, 
HRVI16 and talabostat caused significant ASC- 
GFP oligomerization (fig. S6B). These effects 
were only observed in HeLa-Ohio-ASC-GFP 
cells expressing wild-type NLRP1, not in cells 
expressing the cleavage-resistant NLRP1%?°“ 
mutant (fig. S6B). Thus, live HRV16 infec- 
tion causes 3Cpro-dependent NLRP!1 cleav- 
age and activates the reconstituted human 
NLRP1 inflammasome. 

HRV is one of the most common human viral 
pathogens responsible for respiratory tract in- 
fections. Human airway epithelial cells (AECs) 
are known to endogenously express multiple 
double-stranded RNA sensors such as TLR3, 
MDAS, and RIG-I, which all participate in 
antiviral defense. However, the repertoire of 
endogenous inflammasome components ex- 
pressed in these cells has not been fully 
characterized. By examining published RNA 
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sequencing (RNA-seq) datasets, we found that 
primary AECs (44-48) express a very restricted 
repertoire of NLR sensors (fig. S6C). Known 
inflammasome sensors such as MEFV, NLRP3, 
NLRC5, and AIM2 were expressed at low levels 
or were undetectable (fig. S6E). We confirmed 
that NLRP3 was not expressed in primary 
human bronchial epithelial cells [transcripts 
per million (TPM) < 1] (Fig. 4A and data 82). 
NLRP3 was also undetectable in primary hu- 
man nasal epithelium by RNA in situ staining 
(fig. S6D). The lack of NLRP3 inflammasome 
in primary normal human bronchial epithelial 
cells (NHBEs) was recently confirmed by an 
independent study (49). Although we cannot 
rule out that NLRP3 mRNA may be transcrip- 
tionally induced under specific conditions, 
these results demonstrate that NLRP1, but not 
NLRP3, is the predominant inflammasome 
sensor constitutively expressed in human AECs. 
In addition to NLRPI, all AECs endogenously 
expressed PYCARD (ASC), CASPI, and the IL-1- 
family cytokines ZLIB and IL18 (Fig. 4A, data S2, 
and fig. S6C), suggesting that human AECs are 
capable of NLRP1-mediated inflammasome 
activation. 

We next profiled the endogenous cytokine 
and chemokine response of NHBEs to live 
HRVI16 infection (Fig. 4B). IL-18, whose secre- 
tion strictly depends on inflammasome acti- 
vation, was the most highly induced cytokine 
in both HRV16-infected and talabostat-treated 
NHBEs (Fig. 4, C and D). IL-1B was also sig- 
nificantly secreted from HRV16-infected and 
talabostat-treated NHBEs, but its levels were 
lower and more variable than those of IL-18 
(Fig. 4, E and F). This variability is likely due to 
the transcriptional regulation of ILIB mRNA. 
Similar results were obtained with primary nasal 
epithelial cells (fig. S6E). In agreement with 
the lack of endogenous NLRP3 in NHBEs, 
neither IL-18 nor IL-18 secretion was affected 
by the NLRP3 inhibitor MCC950 after HRV16 
infection or talabostat treatment. By contrast, 
the 3Cpro inhibitor rupintrivir completely ab- 
rogated HRV16-induced IL-18 and IL-18 secre- 
tion (Fig. 4, D and E). HRV16-infected NHBEs 
demonstrated cardinal features of inflamma- 
some activation, including: (i) proteolytic pro- 
cessing of IL-18 and IL-18 into their p17 mature 
forms (Fig. 4F); (ii) endogenous ASC oligomer- 
ization (Fig. 4F); (iii) caspase-1 activation (fig. 
S7A); (iv) release of intact LDH activity (fig. 
S7B); and (v) characteristic membrane “balloon- 
ing” (fig. S7C). We then performed apical HRV16 
infection of three-dimensional (3D) air-lifted 
human bronchial epithelium cultures and con- 
firmed that this serotype induced the secre- 
tion of IL-18 (Fig. 5, A and B). Histologically, 
both HRVI16 infection and talabostat treat- 
ment induced mucus cell metaplasia by Alcian 
blue and periodic acid-Schiff (AB-PAS) stain- 
ing and gasdermin D (GSDMD) cleavage (Fig. 
5C). Thus, HRV16 infection activates the NLRP1 
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but not NLRP3 inflammasome in primary 
NHBEs. 

We next sought to further validate NLRP1 
as the endogenous inflammasome sensor for 
HRV. CRISPR-Cas9-mediated deletion of NLRPI, 
ASC, or pro-caspase-1 in primary NHBEs com- 
pletely eliminated IL-18 cleavage caused by 
HRVI16 infection or talabostat treatment (Fig. 
6A and fig. S7, D and E). Furthermore, HRV16- 
infected NLRPI knockout NHBEs were un- 
able to cleave IL-18 into its mature p17 form 
(Fig. 6B) or assemble ASC into high-molecular- 
weight oligomers (Fig. 6B). This lack of inflam- 
masome activation in KO cells was not caused 
by a general defect in viral biogenesis, as the 
maturation of capsid protein VP2, which itself 
is dependent on 3Cpro activity (50), was un- 
affected in NLRPI'~ or ASC“ cells (Fig. 6B). 
These results provide further support that 
NLRP1 is the primary inflammasome sensor 
for HRV infection in human NHBEs. HRV- 
triggered inflammasome activation likely oc- 
curs in parallel with other immune sensing 
pathways, as NLRPI’~, ASC’, and CASPT’~ 
NHBEs were still capable of undergoing cell 
death and producing other cytokines, such as 
IL-8 (fig. S7, F and G), upon HRV16 infection. 

To confirm that HRV-induced NLRPI acti- 
vation requires the cleavage between residues 
Q130 and G131, we infected NLRPI ^ NHBEs 
rescued with either FLAG-tagged wild-type 
NLRP1 or NLRPI®^, As observed previously 
with keratinocytes (Fig. 2, D to G), both wild- 
type NLRP1 and NLRP1%?™ rescued talabostat- 
dependent IL-18 secretion in NLRPT’” NHBEs 
(Fig. 6C and fig. S8B). However, only wild-type 
NLREPI, and not the uncleavable mutant Q130A, 
rescued HRV16-induced IL-18 secretion (Fig. 
6C), despite similar levels of VP2 maturation. 
In addition, HRV16 induced strong degrada- 
tion of the NLRP1 N-terminal fragment only 
in wild-type NLRP1 rescued NHBEs, but not 
NLRPI2?-expressing cells (Fig. 6D). MLN4924 
and bortezomib completely blocked HRV16- 
dependent IL-1 and IL-18 secretion in NHBEs 
(Fig. GE and fig. S8A). MLN4924 pretreatment 
also stabilized the NLRP1 N-terminal cleavage 
fragments in HRV16-infected NHBEs express- 
ing wild-type NLRP1 (Fig. 6G) without affect- 
ing the accumulation (Fig. 6F) of the activating 
C-terminal fragment of NLRP1 (Fig. 6G). Thus, 
both cullin“*”’“C"® and the proteasome are 
necessary for HRV-triggered NLRP1 inflamma- 
some activation in NHBEs. 

Here, we report that enteroviral 3C proteases, 
such as HRV-3Cpro, activate the human inflam- 
masome sensor NLRPI via direct cleavage at a 
single site between amino acids Q130 and G131. 
This finding not only provides a unified mech- 
anism for proteolysis-mediated activation of 
NLRP1 inflammasome in humans and rodents 
(16-18) but also reveals an unexpected role for 
the recently described N-terminal glycine 
degron pathway in human innate immunity. 
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Fig. 4. HRV infection acti- A 
vates inflammasome-driven 
pyroptosis in NHBEs indepen- 
dently of NLRP3. (A) NLRP1, 80 
NLRP3, PYCARD (ASC), and 
CASP1 mRNA expression (RNA- 
seq TPM) in NHBEs. n = 11, 
independent cultures of NHBEs. 
****P < 0.0001, two-tailed 
Student's t test. Error bars 
represent SEM. (B) Overview of 
cytokine profiling of HRV16- 
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(F) Representative immunoblot demonstrates the secretion of cleaved IL-18 and IL-18 and ASC oligomerization in HRV16-infected primary NHBEs. Cells were 
inoculated with HRV16 (MOI = 5) or treated with talabostat (2 uM) for 48 hours at 33°C. 


Mechanistically, 3Cpro-cleavage leaves a gly- 
cine residue (p.G131) on the NLRP1 fragment, 
which is recognized by the cullin "ZYCH N- 
glycine degron machinery and is subsequently 
degraded by the proteasome (Fig. 3B). This 
constitutes a key step in unleashing the NLRP1 
FIIND’?“-CARD fragment to fully assemble 
the inflammasome complex consisting of ASC 
and caspase-1. It is noteworthy that a single 
cleavage site mutation (Q130A) can completely 
block 3Cpros from activating the NLRP1 inflam- 
masome without affecting talabostat-triggered 
NLRP!1 activation. This suggests that NLRP1 
can undergo multiple modes of activation. 
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The identification of enteroviral 3Cpros as a 
pathogen-derived trigger for human NLRPI1 
also sheds light on the evolutionary trajectory 
of NLRPI1. The 3Cpro cleavage site arose in the 
common ancestor for simians and is absent in 
prosimians such as tarsiers and lemurs (fig. S9). 
It is conceivable that the more recently evolved, 
3Cpro-responsive NLRPI allele has provided a 
selective survival advantage during the evolution 
of simian primates, including humans, presum- 
ably to sense and mount an appropriate im- 
mune response against certain simian-tropic 
enteroviral pathogens. Nevertheless, our work 
does not preclude the possibility that human 
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NLRP1 can detect other pathogen- or danger- 
derived signals besides 3Cpros. 

Our findings establish human NLRP1 as part 
of the repertoire of viral sensors in the airway 
epithelium and challenge the widely held no- 
tion that viral proteases largely serve to disable 
host immune sensing. As the inflammasome 
pathway does not require de novo protein syn- 
thesis, it is particularly well suited as a fail-safe, 
second-line defense. The inflammasome can 
commit the infected cells to pyroptosis and 
release IL-18 even after host transcription and 
translation have been shut off by viral viru- 
lence factors, thereby alerting other cell types 
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Fig. 5. HRV infection leads to inflammasome activation and IL-18 secretion in 3D bronchial epithelial cultures. (A) Overview of apical HRV infection in 3D human 
bronchial epithelial cultures. (B) IL-18 levels in apical or basal media 48 hours after infection or talabostat treatment. **P < 0.01, ***P < 0.001, ****P < 0.0001 (two-way 
ANOVA). n = 3 independent infected or treated cultures. Error bars represent SEM. (C) Representative Alcian blue and periodic acid—Schiff (AB-PAS) staining (top) 


and immunohistochemical staining for cleaved (N-terminal) GSDMD (bottom) of HRV16-infected (MOI 


= 3) and talabostat (2 uM)-treated 3D human bronchial epithelial 


cultures. Staining performed on two sections from three independent 3D bronchial epithelial cultures. Scale bar, 50 um. 


via paracrine signaling. (57). In keeping with 
this notion, NLRPI knockout did not lead to 
an appreciable increase in infectious viruses 
48 hours after inoculation (fig. S8C). We pro- 
pose that the NLRP1 inflammasome plays 
an important role in antiviral immunity and 
inflammation in the human airway epithe- 
lium. Respiratory viral infections, including 
HRVs, are well-known risk factors for the 
exacerbation of asthma and chronic obstruc- 
tive pulmonary disease (52). Therefore, the 
NLREP1 inflammasome pathway might serve as 
a potential therapeutic target to treat these 
diseases and their complications. 


Materials and methods 
Cell culture and chemicals 


293Ts (ATCC #CRL-3216), HeLa-Ohio (ECACC 
General Cell Collection #84121901), and nor- 
mal bronchial epithelial cells (NHBE, Lonza 
#CC-2541) were cultured according to the sup- 
pliers’ protocols with modifications (53) Immor- 
talized human keratinocytes (N/TERT-1, or N/ 
TERT herein) were provided by H. Reinwald 
(MTA) (54). Primary nasal epithelial cells were 
obtained from nasal epithelial biopsies obtained 
from adult patients scheduled for rhinoplasty 
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and obtained with informed consent at Na- 
tional University Hospital (NUH), Singapore. 
Fresh nasal epithelial tissues were used to 
derive nasal cells as previously described (55). 
All cell lines underwent routine Mycoplasma 
testing with Lonza MycoAlert (Lonza #LT07- 
118). The following drugs and chemicals were 
used as part of this study: MCC950 (MCE, #HY- 
12815), bortezomib, and MLN4924 (provided 
by D. Lane; p53, A*STAR, Singapore), rupintrivir 
(Sigma, #PZ0315), talabostat (MCE, #HY-13233), 
MG132 (MCE, #HY13259), A83-01 (MCE, #HY- 
10432A), Y-27632 (MCE, #HY-10583), isoprena- 
line (MCE, #HY-B0468). 


Plasmid transfection and stable cell line 
generation using lentiviruses 


293T-ASC-GFP, N/TERT-ASC-GFP, and N/TERT 
NLRPI-KO cells were described previously (19). 
All transient expression plasmids were cloned 
into the pCS2+ vector using standard restriction 
cloning using ClaIl and XhoI flanking the open 
reading frames. P site-directed mutagenesis 
was carried out with QuickChangeXL II (Agilent 
#200522). Constitutive lentiviral expression was 
performed using pCDH vectors (SystemBio). 
Doxycycline-inducible Tet-ON lentiviral con- 
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structs were based on the pI'RIPZ backbone 
(ThermoFisher Scientific). 


Antibodies, staining reagents, 
and cytokine analysis 


The following antibodies were used in this 
study: c-Myc (Santa Cruz Biotechnology, #sc-40), 
HA tag (Santa Cruz Biotechnology, #sc-805), 
GAPDH (Santa Cruz Biotechnology, #sc-47724), 
ASC (Adipogen, #AL-177), CASP1 (Santa Cruz 
Biotechnology, #sc-622), ILIB (R&D systems, 
#AF-201), FLAG (SigmaAldrich, #F3165), GFP 
(Abcam, #ab290), NLRPI1 (R&D systems, #AF6788), 
IL18 (Abcam ab207324), cleaved GSDMD-NT 
(Asp””) (Cell Signaling Technology, #36425), and 
HRV-VP2 (QED Bioscience, #18758). HRV16- 
3Cpro was detected by rabbit serum provided 
by A. Palmenberg and J. Gern (University of 
Wisconsin). All horseradish peroxidase (HRP)- 
conjugated secondary antibodies were pur- 
chased from Jackson Immunoresearch (goat 
anti-mouse IgG: 115-035-166; goat anti-rabbit 
IgG: 111-035-144; and donkey anti-goat IgG: 
705-005-147). The protocol for blue native PAGE 
and disuccinimidyl suberate (DSS) cross-linking 
were performed as previously described (7). For 
AB-PAS staining and immunohistochemistry, 
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Fig. 6. NLRP1 cleavage and the N-glycine degron pathway are indispensable for 
HRV-dependent inflammasome activation in NHBEs. (A) Endogenous NLRP1, ASC, 
and caspase-l are indispensable for HRV16-induced IL-18 secretion in NHBEs. Cas9- 
control, NLRPI’”, ASC”, and CASPI-’” NHBEs were infected with HRV16 (MOI = 5). 
Conditioned media were harvested 48 hours after infection. *P < 0.05, **P < 0.001 
(two-way ANOVA). n = 3 independent live viral infections per drug treatment. Error bars 
represent SEM. (B) NLRPI is genetically required for HRV16-induced IL-18 cleavage and 
ASC oligomerization. Cas9 control, NLRPI’~, and ASC’ NHBEs were infected with 
HRV16, as before. Cleaved IL-18 are marked with arrows. (C) Mutating the NLRP1 
cleavage site abrogates HRV16-triggered IL-18 secretion in NHBEs. NLRPI’~ NHBEs 
were rescued with lentiviral constructs expressing FLAG-tagged NLRP1™" or 
NLRP12° which carried silent mutations at the PAM site. The rescued cells were 
infected with HRV16 or treated with talabostat. Conditioned media was analyzed by 


Robinson et al., Science 370, eaay2002 (2020) 4, December 2020 


IL-18 ELISA. **P < 0.01, ****P < 0.0001 (two-way ANOVA), n = 3 replicates from one 
of two independent infections. Error bars represent SEM. (D) HRV infection leads to 
NLRP1 cleavage and degradation of NLRP1 N-terminal fragments. NLRPI’~ or 
NLRP1'- or NLRP1°°°*-rescued cells were infected with HRV16 or treated with 
talabostat. Cells were harvested 48 hours after infection. (E) MLN4924 and bortezomib 
block HRV16-induced IL-18 secretion. NHBEs were pretreated with MLN4924 (1 uM) or 
bortezomib (1 uM) before HRV16 infection or talabostat treatment. IL-18 levels were 
measured with ELISA. ***P < 0.001 (two-way ANOVA), n = 3 independent infections 
per treatment. Error bars represent SEM. (F) MLN4924 does not affect 3Cpro- 
dependent VP2 maturation in the course of HRV16 infection. Lysates from HRV16- 
infected NHBEs in Fig. 5F were analyzed by immunoblotting 48 hours after HRV1 
inoculation. (G) MLN4924 stabilizes the 3Cpro cleavage fragment (amino acids 131 to 
1212). NHBEs were pretreated with MLN4924 and infected with HRV as in (D). 
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5-um formalin-fixed paraffin-embedded 3D 
bronchial epithelial sections were deparaffi- 
nized in xylenes and rehydrated through a 
series of alcohol before being hydrated in 
water. AB-PAS staining was then carried out 
using a staining kit (Abacm, ab245876) accord- 
ing to manufacturer’s instructions. For immuno- 
histochemistry staining to visualize cleaved 
GSDMD-N terminal fragment, a previously 
published protocol was used (56). For trypan 
blue staining, cells were incubated with trypan 
blue stain (ThermoFisher Scientific, #15250061), 
and the percentage of positively stained cells 
versus total cell count calculated. Three cell 
counts were performed per condition. For 
RNAscope, primary human nasal epithelium 
sections (normal) were obtained through the 
National University of Singapore (NUS/NUH, 
Singapore) with informed consent. Visual- 
ization of NLRP1 and NLRP3 RNA was per- 
formed according to manufacturer’s instructions 
and recommendations (RNAscope 2.5 HD- 
Red, ACDbio). 

Cytokine and chemokine measurements 
were carried out with human IL-18 enzyme- 
linked immunosorbent assay (ELISA) kit (BD, 
#557953), human IL-18 ELISA kit (MBL Bio- 
science, #7620), and Immune Monitoring 65- 
Plex Human ProcartaPlex Panel (ThermoFisher 
Scientific, EPX650-10065-901) according to 
the manufacturer’s protocol. All primary anti- 
bodies were used at 250 ng/ml. 


Immunoblotting 


For whole cell lysates, cells were resuspended 
in tris-buffered saline 1% NP-40 with prote- 
ase inhibitors (ThermoFisher Scientific, #78430). 
Protein concentration was determined using 
the Bradford assay (ThermoFisher Scientific, 
#23200) and 20 ug of protein loaded, unless 
stated otherwise. For analysis of IL-1B and 
IL-18 cleavage in the media by immunoblot- 
ting, samples were 10 times-concentrated 
using filtered centrifugation (Merck, Amicon 
Ultra, #UFC5003BIK). Protein samples were run 
using SDS-PAGE system (Bio-Rad), immuno- 
blotted, and then visualized using x-ray film de- 
tector or ChemiDoc Imaging system (Bio-Rad). 


Microscopy 


Images of ASC-GFP specs were acquired in three 
random fields in 4’,6-diamidino-2-phenylindole 
(DAPI) (358 nm/461 nm) and GFP (469 nm/ 
525 nm) channels using EVOS microscope 
(FL Auto M5000, ThermoFisher Scientific, 
#AMF5000) with a EVOS 20x Objective (ach- 
romat, LWD, phase-contrast, 0.40NA/6.92WD, 
ThermoFisher Scientific, #AMEP4934) and 
built-in camera (Sony IMX265 monochrome 
CMOS) according to manufacturer’s protocol 
(EVOS M5000 Manual, ThermoFisher Scientific 
Publication Number MANO017563). Counting 
of the number of ASC-GFP specks was per- 
formed using ImageJ Fiji “watershed” and 
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“find maxima” built-in functions. Briefly, all 
DAPI images were subjected to thresholding 
to distinguish background from foreground 
to give a binary image. The watershed algo- 
rithm was used to separate touching objects, 
before segmented objects could be analyzed 
and counted to give a total nuclei count. The 
number of GFP-ASC specks was calculated by 
“find maxima” function and noise tolerance 
set accordingly to count the total number 
of specks per image. The number of specks 
was then calculated as a percentage of total 
nuclei in three representative fields of view. 
Brightfield images of stained sections were 
taken at 40x magnification using a widefield 
microscope (Zeiss AxioImager Z.2, Zeiss) 
with a 40x oil immersion objective (EC Plan 
Neofluar Antiflex, Zeiss). Images were acquired 
with the built-in camera (Axio Cam 506 mono 
1.2 Mbps, Zeiss). 


HRVI16 virus propagation 


HRV used in the study was HRV-A16 (strain 
11757; ATCC VR-283, Manassas, VA, USA) and was 
propagated in HeLa cell line (HeLa Ohio, ECACC 
84121901, Porton Down, Salisbury, Wiltshire, 
UK). HeLa cells were grown in Eagle’s Minimum 
Essential Medium (EMEM) ATCC 30-2003, 
supplemented with 10% fetal bovine serum 
(FBS) (BioWest, Kansas City, MO, USA), 
2% HEPES, and 1% Antibiotic-Antimycotic (Anti- 
Anti) (Gibco) and incubated at 37°C in a humid- 
ified incubator with 5% CO». To propagate 
HRV16, HeLa cells were first seeded to achieve 
80 to 90% confluence in 24-well plates overnight. 
Cells were rinsed with LX Dulbecco’s phosphate- 
buffered saline (DPBS) and infected with HRV16 
before addition of EMEM with 2% FBS, 
2% HEPES, and 1% Anti-Anti. Infected HeLa cells 
were incubated at 33°C for 2 to 3 days. Viruses 
were harvested from the supernatants of in- 
fected HeLa cells when ~80% cytopathic effects 
(CPE) were observed. HRV stocks were cen- 
trifuged at 1500g for 10 min at 4°C to remove 
cellular debris and aliquoted into cryovials 
for storage at —80°C. 


Inoculation of human rhinovirus 


HRV was diluted using the respective cell cul- 
ture medium and inoculated at MOI of 5.0 
(NHBE) and 1.0 (HeLa), respectively. Infected 
cells were incubated at 33°C for 1 hour. Con- 
ditioned noninfected cell culture medium from 
viral propagation was added as an uninfected 
control. The HRV-infected and control cells 
were then incubated at 33°C for up to 48 hours 
post-infection (hpi). Cell culture supernatant 
and cell lysate were collected to perform rele- 
vant assays between 24 and 72 hpi. 


Viral quantification using rhinovirus plaque assay 


HeLa cells (at 85 to 95% confluence) in 24-well 
plates were incubated with 100 ul of serial 
dilutions from 107 to 10° of virus-containing 
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conditioned media at 33°C for 1 hour. The 
plates were rocked every 15 min to ensure 
equal distribution of virus. The inoculum was 
removed and replaced with 1 ml of Avicel (FMC 
Biopolymer) overlay to each well and incubated 
at 33°C for 65 to 72 hours. The overlay compo- 
nents were optimized to obtain HRV plaques 
suitable for counting. Avicel powder was added 
into double-strength MEM to formulate 1.2% 
Avicel solution, with a final concentration com- 
prising 3% FBS, 2% HEPES, 1.5% NaHCO», 
3% MgCl, and 1% Anti-Anti. Avicel overlay was 
removed after the incubation period, and cells 
were fixed with 20% formalin in PBS for 1 hour. 
Formalin was removed, and cells were washed 
with PBS. The fixed cells were stained with 
1% crystal violet for 15 min and washed. The 
plaque-forming units (PFU) were calculated 
as follows: number of plaques x dilution factor = 
number of PFU per 100 ul, which is then ex- 
pressed as PFU per milliliter. 


HRVI16 infection of 3D reconstructed human 
bronchial epithelium 


3D culture of bronchial epithelium was purchased 
from Mattek (AIR-1484 AIR-100 EpiAirway, 3D 
Respiratory Epithelial Human MicroTissues) 
and cultured using the Extended Culture pro- 
tocol as advised by the supplier. 


CRISPR-Cas9 knockout 


CRISPR-Cas9 editing performed in 293T cells 
was performed according to the method re- 
ported by the Doyon group (57), except that 
single guide RNAs (sgRNAs) were cloned into 
pSpCas9(BB)-2A-Puro (PX459) V2.0 (Addgene 
62988). Single clones of ouabain-resistant cells 
were selected for ZYG11B/ZER1 double knock- 
out. Clones 1 to 6 were transfected with ZYGIIB 
sg5 and 6. Clones 7 to 12 were transfected with 
ZYGIIB sg7 and 8. All clones received three 
guides for ZER1, sg5, 6, 8. 293T UBR2 KOs were 
cultured as polyclonal pools after ouabain selec- 
tion. N/TERT and NHBE KOs were performed 
using LentiCRISPR-V2 (Addgene 52961) and 
stable lentiviral transduction. The sgRNAs 
used are listed in Table 1. Knockout efficiency 
was tested with immunoblot or Sanger se- 
quencing of the targeted genomic DNA locus 
7 to 10 days after puromycin selection. 


RNA-seq of NHBEs 


Library preparation, quality control, and high- 
throughput sequencing were provided by 
Macrogen, Singapore. Total RNA isolated from 
NHBEs was processed using Library Kit TruSeq 
Stranded mRNA LT Sample Prep (Illumina) 
according the manufacturer’s protocol speci- 
fied in Library Protocol TruSeg Stranded mRNA 
Sample Preparation Guide, Part 15031047. Se- 
quencing was carried out with NovaSeq 6000 
with NovaSeq 6000 S4 Reagent Kit and Se- 
quencing Protocol NovaSeq 6000 System User 
Guide Document 1000000019358 v02 (Illumina). 
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Table 1. CRISPR sgRNA sequences used to generate knockout 293Ts, immortalized 


keratinocytes, and NHBEs. 


sgRNA used (gene and sgRNA number) 
NLRP1#1 


Sequence (5’ to 3’) 
GATAGCCCGAGTGACATCGG 


ZYGLIB#S 


Trimmed reads are mapped to reference ge- 
nome with HISAT2. After the read mapping, 
Stringtie was used for transcript assembly. 
Expression profile was calculated for each sam- 
ple and transcript/gene as read count and 
FPKM (fragment per kilobase of transcript per 
million mapped reads). 


Analysis of published RNA-seq datasets 


RNA-seq data for human AECs was extracted 
from the following published datasets: Nasal#1- 
GSE107898, Nasal#2-GSE55458, Bronchial#1- 
GSE107971, Bronchial#2-GSE107897, and Alveolar#1- 
GSE61220. In datasets where multiple treatment 
conditions were reported, only the basal/mock 
conditions were selected. RPM values Nasal#2- 
GSE55458 and Bronchial#1-GSE107971 were 
logged for heatmap generation. Color map- 
ping was performed using the “Double gra- 
dient” method in Graphpad Prism 8. 


Caspase-1 GLO and LDH release 


Caspase-1 GLO (Promega, #G9951) and LDH 
release assay (Promega CytoTox96, #G1780) 
were both carried out according to protocols 
provided by the manufacturers. 


Statistical analysis 


Statistical analyses were performed using 
Prism 8 (GraphPad). The methods for analy- 
sis are included in the figure legends. Error 
bars show mean values with SEM. 
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EVOLUTION 


Selection enhances protein evolvability by increasing 
mutational robustness and foldability 


Jia Zheng, Ning Guo, Andreas Wagner” 


INTRODUCTION: Natural selection plays a cen- 
tral role in adaptive evolution, but we still know 
little about its role in changing evolvability— 
the ability to bring forth new and adaptive 
phenotypes. Different kinds of selection may 
increase evolvability by different means. Weak 
purifying selection may enhance evolvability 
by promoting the accumulation of neutral or 
slightly deleterious mutations that can serve 
as stepping stones toward new phenotypes. 
By contrast, strong directional selection may 
enhance evolvability by favoring the accumu- 
lation of beneficial mutations that can enhance 
both fitness and evolvability, such as mutations 
that increase a protein’s thermodynamic stabil- 
ity or its robustness to mutations. 


RATIONALE: To find out how the strength of 
selection affects protein evolvability, we sub- 
jected populations of yellow fluorescent pro- 
teins to multiple rounds of directed evolution 


Robustness 
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in Escherichia coli. To control the strength of se- 
lection with precision, we used high-throughput 
phenotypic screening via fluorescence-activated 
cell sorting. During a first phase of our experi- 
ment (phase I), we subjected our populations 
to either strong selection, weak selection, or no 
selection on the ancestral phenotype of yellow 
fluorescence. During the second phase (phase 
II), we evolved all populations under the same 
selection pressure toward the new phenotype 
of green fluorescence. We subsequently used 
high-throughput phenotypic screening to 
study how phenotypes evolved in all our 
populations. In every generation, we also 
studied genotypic evolution with single-molecule 
real-time sequencing. We then engineered key 
adaptive mutants and determined their phe- 
notype and thermodynamic stability. In addi- 
tion, we determined the robustness of their 
phenotype to DNA mutations. Furthermore, 
we quantified the foldability of these mutants 
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Selection can drive evolvability. Evolutionary theory holds that Darwinian evolution takes place on adaptive 
landscapes of fitness, which can be visualized as topological maps of high-fitness peaks and low-fitness valleys. 
This hypothetical landscape illustrates how mutations can increase evolvability by enhancing both fitness and 
mutational robustness. Favored by strong selection because they enhance fitness, such mutations move an 
evolving population into a region of low curvature and high robustness (red arrow), from which the population can 
bypass rather than traverse (blue arrow) an adaptive valley on its way to an adaptive peak. 
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by unfolding them and observing their refold- 
ing kinetics. 


RESULTS: We found that populations under 
strong selection for the ancestral yellow fluores- 
cent phenotype during phase I subsequently 
evolved the new green fluorescent phenotype 
most rapidly during phase IJ. Compared to 
populations under weak or no selection, they 
reached higher green fluorescence during each 
generation of phase II and evolved a green 
emission peak more rapidly. Strong selection 
promoted both the elimination of deleterious 
mutations and the accumulation of foldability- 
improving mutations. As a result, proteins under 
strong selection evolved higher efficiency of 
protein folding (foldability) and, to an even 
greater extent, higher robustness to mutations 
than proteins under weak or no selection. Their 
robustness and foldability accelerated the selec- 
tive sweeps of neofunctionalizing mutations 
that are necessary to evolve a new phenotype. 
By contrast, proteins under weak selection 
accrued more deleterious mutations that slowed 
down the fixation of neofunctionalizing muta- 
tions during the evolution of the new phenotype, 
even though neofunctionalizing mutations 
had initially risen to higher frequencies under 
weak selection. 


CONCLUSION: Strong directional selection en- 
hances the evolvability of a new phenotype to 
a greater extent than weak purifying selection. 
The responsible mutations enhance tolerance 
to mutations, improve protein foldability, and 
thus increase accessibility of a protein’s na- 
tive state. In doing so, they promote the for- 
mation of correctly folded states that can 
display new functions after incorporating neo- 
functionalizing mutations. Although “first 
order” selection of fitness-enhancing muta- 
tions can be in conflict with “second-order” 
selection of evolvability-enhancing mutations, 
our experiments demonstrate a class of mu- 
tations that avoid this conflict, because the 
mutations they reveal enhance both fitness 
and evolvability. In the context of an adaptive 
landscape (see figure), they do so by circum- 
navigating rather than traversing adaptive 
valleys, passing through flat regions of such 
a landscape, and thus allowing an evolving 
population to climb a new adaptive peak more 
rapidly. More generally, our experiments prove 
that natural selection itself can create the con- 
ditions under which Darwinian evolution can 
succeed. 
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EVOLUTION 


Selection enhances protein evolvability by increasing 
mutational robustness and foldability 


Jia Zheng’*, Ning Guo*, Andreas Wagner’*** 


Natural selection can promote or hinder a population’s evolvability—the ability to evolve new and 
adaptive phenotypes—but the underlying mechanisms are poorly understood. To examine how the 
strength of selection affects evolvability, we subjected populations of yellow fluorescent protein to 
directed evolution under different selection regimes and then evolved them toward the new phenotype of 
green fluorescence. Populations under strong selection for the yellow phenotype evolved the green 
phenotype most rapidly. They did so by accumulating mutations that increase both robustness to 
mutations and foldability. Under weak selection, neofunctionalizing mutations rose to higher frequency 
at first, but more frequent deleterious mutations undermined their eventual success. Our experiments 
show how selection can enhance evolvability by enhancing robustness and create the conditions 


necessary for evolutionary success. 


atural selection drives adaptation, but we 

still know little about its role in chang- 

ing the evolvability of a trait or organism 

(1, 2). On the one hand, strong selection 

for an ancestral phenotype may enhance 
evolvability for derived phenotypes, because 
it may favor mutations that enhance not just 
fitness but also evolvability. The available 
evidence is limited and indirect (3-5). For 
example, a cytochrome P450 BM3 variant 
engineered for greater stability and fitness 
buffers the destabilizing effect of mutations 
that are neofunctionalizing, i.e., that convey 
new protein activities (5, 6). 

On the other hand, strong selection may 
impair evolvability because it purges weakly 
deleterious mutations that can convey new 
functions (7). Evolutionary theory holds that 
selection helps populations find peaks in 
adaptive landscapes of fitness, which can be 
visualized as topological maps of peaks and 
valleys. Weakly deleterious “stepping-stone” 
mutations may help a population traverse the 
valleys that separate different fitness peaks. 
Such valleys, which are caused by epistatic 
interactions between different mutations, are 
abundant in the adaptive landscapes of evolv- 
ing proteins (8-17). Weak selection that purges 
only the most detrimental mutations can aid 
such valley-crossing. Consistent with this view, 
when the enzyme ß-lactamase TEM-1 is sub- 
ject to “intense neutral drift” during experi- 
mental evolution—multiple rounds of mutation 
and selection to preserve its native phenotype 
of ampicillin resistance—the evolution of re- 
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sistance to the newly introduced antibiotic 
cefotaxime is accelerated (3, 12). More gener- 
ally, experimental evolution shows that a popu- 
lation evolves a derived phenotype faster if it 
harbors genetic variation in the loci that affect 
the ancestral phenotype (3, 13, 14). Such stand- 
ing genetic variation may even be adaptive if it 
has little effect on the ancestral phenotype 
(15). Therefore, conditionally neutral or weakly 
deleterious mutations can accelerate the adapt- 
ive evolution of a derived phenotype (15). Here 
we performed experiments aimed to find out 
whether strong or weak selection more ef- 
fectively enhances evolvability. 


Results 
Strong selection leads to greater evolvability 
than weak or no selection 


We evolved yellow fluorescent protein [YFP, a 
variant of a jellyfish fluorescent protein (J6)] 
in Escherichia coli from an ancestral pheno- 
type (yellow fluorescence) to a derived pheno- 
type (green fluorescence) (Fig. 1A). The protein 
was engineered to be well expressed in E. coli 
(16), but it is not native to E. coli, which mini- 
mizes interference with the native E. coli pro- 
teome. Studying evolvability in a single protein 
can help us analyze the causes of evolvability 
in molecular detail. In addition, the fluores- 
cent phenotype permits us to control the 
strength of selection with precision, because 
fluorescence-activated cell sorting (FACS) can 
screen individual cells for their phenotype. 
We subjected each of four replicate popula- 
tions of E. coli expressing yellow fluorescent 
proteins to four rounds (“generations”) of 
directed evolution. In phase I of our experiment 
(Fig. 1A), we selected for yellow fluorescence 
through either strong selection (populations 
S, the top ~20% of fluorescing cells survive) 
or weak selection (populations W, cells that 
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fluoresce above background survive) (17). We 
also subjected four replicate populations of 
yellow fluorescent proteins to four rounds of 
directed evolution without any selection for 
fluorescence (populations N, subject only 
to neutral drift). After phase I evolution, we 
initialized phase II evolution by subjecting the 
same populations to another four rounds of 
directed evolution under selection for green 
fluorescence (Fig. 1A) (77). We used mutagenic 
polymerase chain reaction (PCR) to generate 
0.84 amino acid-changing mutations per YFP 
molecule per generation (tables S1 to S3). 

During phase I evolution, yellow fluorescence 
in the unselected populations N dropped rap- 
idly and decreased to <5% of the ancestral YFP’s 
intensity, indicating an accumulation of dele- 
terious mutations (Fig. 1B). For populations 
under weak selection, W, the intensity of yellow 
fluorescence decreased after the first genera- 
tion of evolution but remained constant in the 
next three generations (Fig. 1B). This suggests a 
mutation-selection balance between the pro- 
duction of new deleterious mutations and selec- 
tion against such mutations. By contrast, yellow 
fluorescence in populations under strong selec- 
tion, S, significantly increased by 92.5% after 
four generations of evolution (one-sided t test, 
P <0.001; Fig. 1B), indicating a likely spread of 
beneficial mutations. 

We genotyped ~500 to 1000 protein var- 
lants per population and generation (table S4), 
which revealed that our evolving populations 
harbored different amounts of genetic varia- 
tion. Specifically, during phase I, populations 
W accumulated more amino acid-changing 
mutations and greater genetic diversity than 
populations S (figs. S1 and S2). Because greater 
genetic diversity may facilitate adaptive evo- 
lution, we hypothesized that populations W 
may have greater potential than populations 
S to evolve green fluorescence during phase 
II evolution. However, the opposite was the 
case. Populations S reached significantly higher 
green fluorescence than populations W during 
the first two generations of phase II evolution 
(one-sided Dunnett test with single-step adjust- 
ment, P < 0.05; Fig. 1C). In addition, popula- 
tions S evolved a green (512 nm) emission peak 
more rapidly than populations W (fig. S3). 
Analogous differences exist between popula- 
tions W and unselected populations N (Fig. 1C, 
and figs. S1 to S3). In sum, selection on an 
ancestral phenotype, and in particular strong 
selection, facilitates the evolution of a derived 


phenotype. 


Strong selection on an ancestral phenotype 
leads to the most rapid fixation of 
neofunctionalizing mutations 


To find out why strong selection causes greater 
evolvability than weak selection, we studied the 
dynamics of genetic polymorphisms in each 
replicate population during phase II (fig. S4). 
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Fig. 1. Experimental evolution of yellow fluorescent protein. (A) We subjected four replicate E. coli 
populations for each experimental treatment to directed evolution under selection for yellow fluorescence 
(phase |, Ae, = 488 nm and Aem = 530 + 15 nm). After four mutation-selection cycles, we continued 
directed evolution for four more cycles but under selection for green fluorescence (phase Il, Ae, = 405 nm 
and Aem = 525 + 25 nm) by selecting the top 0.01% of cells in each generation. The areas shaded in 
yellow or green indicate the proportion of a population allowed to survive to the next generation. 

(B and C) Fold-change of yellow (dashed lines) and green (solid lines) fluorescence intensity relative to 
ancestral YFP in each generation of phase | (B) and II (C). Error bars represent 1 SEM, from four replicate 
populations (single small symbols). *P < 0.05; **P < 0.01; ***P < 0.001 [one-sided Dunnett tests with 
single-step adjustment to compare S with W (blue) or N (black)]. 


Two mutations (G66S and Y204C) swept 
through each replicate S, W, and N popula- 
tion. Because of their ubiquity, we refer to 
these two mutations as universal mutations. 
In addition, another 20 mutations attained a 
frequency exceeding 30% in at least one re- 
plicate of populations S, W, and N. To determine 
whether all these (2 + 20) mutations are 
adaptive for green fluorescence, we engineered 
each of them into the ancestral YFP and 
measured their effects on green fluorescence. 
Only the two universal mutations G66S and 
Y204C caused green shifts of the emission 
peaks (Fig. 2A). Individually, these mutations 
caused a ~9-fold and ~2-fold increase in green 
fluorescence (fig. S5), and together they shifted 
the emission peak from yellow (530 nm) to green 
(512 nm; Fig. 2A). Thus, G66S and Y204C are 
the only neofunctionalizing mutations. 
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To further characterize the role of the re- 
maining mutations, we engineered each of 
these mutations into the background of 
G66S+Y204C, referred to as genotype U for 
“universal,” and measured the effect of these 
20 genotypes on green fluorescence. Nine 
mutants significantly enhanced green fluores- 
cence in the background of U (two-sided Dunnett 
test with single-step adjustment, P < 0.001; fig. 
S5), but none of these changed the emission 
spectrum (Fig. 2B). These mutations might 
increase the amount of soluble and functional 
fluorescent protein, which may help explain 
why green fluorescence also increased, albeit 
very modestly, during selection for yellow 
fluorescence in phase I (Fig. 1B). 

Because only two mutations are responsible 
for the green shift in phase II, we suspected 
that the rapid spread of these two mutations 
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resulted in faster adaptation of the strong 
selection (S) populations compared to the weak 
selection (W) and unselected (N) populations. 
Indeed, populations S displayed a higher fre- 
quency of the universal mutations G66S and 
Y204C, and of the universal genotype U, in at 
least last three of the four phase II generations 
(Fig. 20). 

Because the S, W, and N populations were 
subject to identical selection pressures during 
phase II, these faster sweeps likely originated 
in differences between populations at the end 
of phase I. We investigated if the reason was 
that the neofunctionalizing mutations had 
already attained a higher frequency in popula- 
tions S at the end of phase I. However, this was 
not the case. Both universal mutations had 
lower frequency in populations S at the end 
of phase I evolution, and one of them (Y204(C) 
had a significantly lower frequency in pop- 
ulations S than in populations W and N (one- 
sided Dunnett test with single-step adjustment, 
P < 0.001; fig. S6). 


Strong selection leads to greater 
mutational robustness and higher foldability 
than weak selection 


To resolve this apparent paradox—faster spread- 
ing of the neofunctionalizing mutations G66S 
and Y204C despite their lower initial frequency— 
it is relevant that our evolving sequences are 
likely to acquire one or more new mutations 
(with probability 0.53) in every generation 
(table S2). In addition, mutations that are dif- 
ferent from the two neofunctionalizing mutations 
are expected to arise by chance alone 412-fold 
more often than these two mutations (table 
S2). This means that most variants containing 
neofunctionalizing mutations will also accu- 
mulate many other mutations, most of which 
are near-neutral or deleterious (78, 19). Increased 
robustness to such slightly deleterious muta- 
tions would increase the fitness of genotypes 
carrying neofunctionalizing mutations and 
thus enable their spreading. We thus hypothe- 
sized that populations S had acquired genetic 
changes that cause greater robustness to dele- 
terious mutations. 

To validate this hypothesis, we mutagen- 
ized populations S, W, and N at the end of 
phase I and determined the residual fluo- 
rescence and the frequency of fluorescence- 
positive variants after mutagenesis. Populations 
S had indeed acquired greater mutational 
robustness, in both the ancestral yellow pheno- 
type and the derived green phenotype. Specif- 
ically, populations S retained significantly higher 
yellow fluorescence intensity than popula- 
tions W and N after mutagenesis (two-sided 
Dunnett test with single-step adjustment, P < 
0.05 and 0.01 for comparing S with W and 
N; Fig. 3A). In addition, the postmutagenesis 
frequency of yellow-fluorescence-positive 
variants in S populations was 1.24-fold and 
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Fig. 2. Most rapid fixation of neofunctionalizing mutations after strong selection on the ancestral 
phenotype. (A and B) Emission spectra at the new excitation wavelength (405 nm) of mutants introduced into 
ancestral YFP (A) and U (B). The vertical axes indicate the fold-change of green fluorescence intensity relative to 
ancestral YFP at a given emission wavelength (horizontal axes). Each curved line represents the emission 
spectrum of 1 of the 44 mutants (n = 3). Colored lines in (A) indicate those emission-shifted mutations, and 
colored lines in (B) indicate mutations that significantly improved green fluorescence in the genetic background U 
(two-sided Dunnett test with single-step adjustment, P < 0.05; fig. S5). Yellow and green vertical dashed lines 
indicate 530 nm (Ae, = 485 nm) and 512 nm (Aex = 405 nm). (C) Frequencies of neofunctionalizing mutations 
GoO6S and Y204C or the double-mutant U during evolution. Error bars represent 1 SEM from four replicate 
populations. *P < 0.05; **P < 0.01; ***P < 0.001 [one-sided Dunnett tests with single-step adjustment to 
compare S with W (blue) or N (black)]. Abbreviations for the amino acid residues are as follows: A, Ala; C, Cys; 
E, Glu; F, Phe; G, Gly; I, Ile; K, Lys; L, Leu; M, Met; S, Ser; V, Val; and Y, Tyr. 


13.0-fold higher than in W and N populations 
(Fig. 3B). Moreover, the postmutagenesis fre- 
quency of high-green variants, which have 
higher green fluorescence than the ances- 
tral YFP, was 3.8-fold and 326.4-fold higher 
in S populations than in W and N popula- 
tions, respectively (Fig. 3B). 

Because most deleterious mutations reduce 
protein solubility by causing protein misfold- 
ing and instability (20, 21), we suspected that 
populations S have evolved the ability to buffer 
such mutations by harboring protein variants 
with especially high foldability (folding effi- 
ciency) or stability (the free energy required 
to unfold a protein). Folding efficiency and 
stability can be jointly quantified by the con- 
centrations of soluble protein in vivo (22). We 
thus measured and compared the amount of 
soluble fluorescent proteins in populations 
S, W, and N at the end of phase I (17). The 
amount of soluble proteins in populations 
S relative to that of ancestral YFP is 1.9-fold 
and 3.8-fold higher than that in populations 
W and N (Fig. 3C). By contrast, the fraction of 
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insoluble protein in populations S is 2.8-fold 
and 2.5-fold lower than that in populations W 
and N (Fig. 3D and fig. S7), respectively. 

We also analyzed the role of foldability more 
directly by quantifying the refolding yield of 
fluorescent proteins after unfolding (77). In- 
deed, a higher percentage of unfolded fluores- 
cent proteins in populations S refolded than in 
populations W, N, and in ancestral YFP during 
24 hours of refolding (Fig. 3E and fig. S8A). 
Also, fewer than 50% of unfolded proteins re- 
folded correctly at 25°C within 24 hours (Fig. 
3E), demonstrating that solubility is likely 
limited by correct protein folding. In addition, 
the foldability of populations S did not de- 
crease after random mutagenesis, whereas it 
decreased in populations W and N (fig. S8B). 
This observation suggests that increased fold- 
ability also increases mutational robustness. 

Foldability cannot be completely disentangled 
from protein stability, because many mutations 
affect both (20, 23). Stability can be estimated by 
monitoring a protein’s structural integrity over 
time, and we measured for all our populations 
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the residual yellow fluorescence after 12 hours 
of incubation at 37°C, the temperature at which 
we had conducted our experiments. All pop- 
ulations retained more than 95% of yellow 
fluorescence (fig. S8, C and D). Populations S 
showed higher stability than populations 
W, N, and ancestral YFP only at higher, 
unphysiological temperatures above 65°C 
(fig. S8, E and F). In sum, our experiments sug- 
gest that the higher robustness of populations 
S is primarily caused by higher foldability. 


Foldability-improving mutations result 
in greater mutational robustness and 
higher evolvability 


We next aimed to identify the genetic changes 
that increased foldability and mutational 
robustness by examining our sequence data. 
We focused on the 2 neofunctionalizing and 
20 non-neofunctionalizing mutations that 
reached a frequency exceeding 30% at the 
end of phase II in populations S, W, and N 
(fig. S4:). The best candidates among them are 
four variants known to improve foldability 
(F47L, F65L, V164A, and 1172V) (23-25). All 
of these mutations and a fifth one (K102E) 
reached a higher frequency at the end of phase 
I in S populations relative to both W and N 
populations (fig. S9). 

When we unfolded these mutants and re- 
folded them (17), all five mutants yielded more 
correctly refolded protein than ancestral YFP 
(Fig. 4A), and three of them also refolded more 
rapidly than ancestral YFP (table S5). In ad- 
dition, the mutations cause increased protein 
solubility (figs. SIOA and S11). Notably, four of 
them also increased foldability (figs. S1OB and 
table S5), and two of them significantly in- 
creased solubility in the background U (one- 
sided Dunnett test with single-step adjustment, 
P < 0.05; figs. S10 C and S11), which folds with 
similarly low yield (~30%) as ancestral YFP (fig. 
S10D and table S5). The mutants also increased 
thermostability, albeit only at unphysiologically 
high temperatures (fig. S10, E to G, and table 
S6), indicating that they probably do not in- 
crease solubility by improving thermostability. 
In sum, all five key mutations improve fold- 
ability or protein solubility in the ancestral YFP 
background, and four of them do so in the 
green fluorescent protein background U. 

To estimate the mutational robustness of 
specific mutants, we used PCR to introduce 
random mutations into ancestral YFP and into 
each of the 22 high-frequency YFP variants. We 
then calculated the percentage of fluorescence 
retained relative to fluorescence before muta- 
genesis. After random mutagenesis, four out of 
five foldability-improving mutants retained greater 
yellow fluorescence (by 13.3 to 25.2%) than 
ancestral YFP (Fig. 4B), which indicates that 
they increased robustness. In one of the mutants 
(K102E), the increase in robustness was margin- 
ally significant (two-sided Dunnett test with 
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Fig. 3. Strong selection leads to greater mutational robustness and higher 
foldability than weak selection. (A) Yellow fluorescence retained by each 
population at the end of phase | after mutagenesis relative to its yellow 
fluorescence without mutagenesis. (B) Frequencies of cells fluorescing above 
background in yellow (left) (17) or above ancestral YFP (Anc) in green (right) (17) 
from each population at the end of phase | after mutagenesis. (C and D) Amount 


single-step adjustment, P = 0.064; Fig. 4B). 
After random mutagenesis, none of the other 
17 mutations significantly retained greater yel- 
low fluorescence intensity than ancestral YFP 
(two-sided Dunnett test with single-step ad- 
justment, P > 0.05; fig. S12). In addition, ran- 
dom mutations in all foldability-improving 
mutants created significantly more variants 
that remained yellow fluorescence-positive 
than ancestral YFP (two-sided Dunnett test 
with single-step adjustment, P < 0.05; Fig. 4C). 
Thus, foldability-improving mutations enhance 
the mutational robustness of the original yellow- 
fluorescence phenotype. 

In addition, the five foldability-improving 
mutants also enhanced the robustness of the 
derived green fluorescence phenotype. First, 
after mutagenesis, they retained significantly 
higher green fluorescence intensity (15.2 to 
35.3% higher) than randomly mutated ances- 
tral YFP (two-sided Dunnett test with single- 
step adjustment, P < 0.05; Fig. 4B). Second, 
random mutations in the foldability-improving 
mutants created significantly more variants 
that remained green fluorescence-positive 
(14.6 to 37.5% more) than ancestral YFP (two- 
sided Dunnett test with single-step adjustment, 
P < 0.01; Fig. 4C). In addition, three foldability- 
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improving mutations (F47L, F65L, and I172V) 
significantly improved green fluorescence (by 
more than 27.0%) when combined with the 
two neofunctionalizing mutations (two-sided 
Dunnett test with single-step adjustment, P < 
0.001; fig. S5). These results suggest that the 
foldability-enhancing mutations can accelerate 
a selective sweep of neofunctionalizing mutants 
during phase II. In support of this observation, 
the foldability-improving mutations achieved 
higher frequency in S populations than in W 
and N populations during each generation of 
evolution (fig. S13). Moreover, populations S 
retained a greater percentage of cells with 
higher green fluorescence than the ancestor 
during phase II evolution (Fig. 5A). 

The advantage of the foldability-enhancing 
mutations could be mainly caused by their ef- 
fect on robustness or by their effect on increas- 
ing fluorescence. We distinguished these two 
possibilities by measuring the effect of each 
mutation on fluorescence with or without muta- 
genesis. For example, the mutant F47L alone 
caused a 1.067-fold increase in green fluores- 
cence-positive cells relative to ancestral YFP. 
After random mutagenesis of the mutant F47L, 
the increase in green fluorescence-positive cells 
relative to the mutagenized ancestral YFP was 
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of soluble protein relative to ancestral YFP (C) and insoluble protein fraction (D) 
in each population at the end of phase |. (E) Recovery of yellow fluorescence over 
time (horizontal axis) during refolding of unfolded fluorescent proteins at 25°C 
(17). Error bars represent 1 SEM based on four replicate populations. *P < 0.05; 
**P < 0.01; ***P < 0.001 (two-sided Dunnett tests with single-step adjustment to 
compare S with W, N, or ancestral YFP). 


1.390-fold. Thus, mutational robustness was re- 
sponsible for most [84.2% = 100 x (1.390 - 
1.067)/(1.390 —1.0)] of the mutantť’s benefit for 
green fluorescence phenotype. This holds also 
for the other mutations, where we estimate that 
on average, >75% of the fluorescence benefit 
comes from increased robustness (Fig. 5B). 
Thus, foldability-improving mutations likely 
promoted the spreading of neofunctionalizing 
mutations by enhancing robustness. This is con- 
sistent with the observation that the foldability- 
improving mutations did not greatly increase 
specific green and yellow fluorescence on their 
own (table S7). In the genetic background U, 
three of these mutations (F4:7L, F65L, and I172V) 
also increased specific green fluorescence by 
more than 14% (table S7), and two of them 
(F47L and F65L) may have done so because of 
their spatial proximity to the chromophore (fig. 
S14). Such improvements in specific green flu- 
orescence might additionally help promote 
the fixation of U. 


Selection can eliminate deleterious 
mutations despite the presence of 
robustness-enhancing mutations 


Although robustness-enhancing mutations aug- 
ment the advantage of other beneficial mutations, 
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Fig. 5. Enhanced mutational robustness promotes sweeps of adaptive mutations. (A) Percentage of cells with higher green fluorescence than ancestral YFP in 
populations S, W, and N after mutagenesis during each generation of evolution (17). (B) Increased mutational robustness (purple) rather than fluorescence increase 
(orange) is the major contributor to the acceleration of green fluorescence evolution by foldability-increasing mutations. (C) Strong selection purges nonsense 
mutations most efficiently during phase | evolution. Error bars represent 1 SEM from four replicate populations [(A) and (C)] or 1 SD over three biological replicates 
(B). *P < 0.05; **P < 0.01; ***P < 0.001 [one-sided Dunnett tests with single-step adjustment to compare S with W (blue) or N (black)]. 


their interactions with deleterious mutations 
are more complex. If they completely mask 
the effects of a deleterious mutation, this mu- 
tation may be preserved. However, if the mu- 
tation remains somewhat deleterious even 
in the presence of robustness-enhancing mu- 
tations, it may still be eliminated by selection, 
and at a rate that depends on the strength of 
selection. To test whether selection in S pop- 
ulations was sufficiently strong to eliminate 
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deleterious mutations, we first used nonsense 
mutations, which produce truncated protein 
isoforms (26). Indeed, the frequency of non- 
sense mutations remained 2.1-fold lower in 
S than in W populations and 18.7-fold lower 
than in N populations at the end of phase I 
(Fig. 50). 

To complement this analysis, we also used 
the FoldX algorithm to predict destabilizing 
mutations (17). Such destabilizing mutations 
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too became depleted in S populations (figs. S15 
to S17). Specifically, highly destabilizing muta- 
tions had a consistently lower frequency in 
S populations than in W and N populations 
throughout phase I evolution (figs. S15A to 
S17). At the end of phase I, this frequency was 
2.4-fold and 9.7-fold lower in S populations 
than in W and N populations, respectively 
(figs. S15A). Also, the protein variants of popu- 
lations S were on average more stable than 
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Fig. 6. Mutational robustness helps increase genetic diversity and evolv- 
ability. (A) Increase in the average number of amino acid-changing mutations per 
fluorescent protein molecule relative to the beginning of phase II for evolving 
populations S, W, and N. (B) Increase in genetic diversity relative to the beginning of 
phase I| for evolving populations S, W, and N. The data show the increase in the 
number of amino acid-changing mutations (A) or in genetic diversity (B) in 
populations S, W, and N at the end of phase Il, relative to the end of phase l. Error 
bars represent 1 SEM from four replicate populations. *P < 0.05; **P < 0.01; ***P < 
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0.001 [one-sided Dunnett tests with single-step adjustment to compare S with W 
(blue) or N (black)]. (C) Robustness-enhancing mutations can help evolving 
populations bypass fitness valleys in an adaptive landscape. Robustness-enhancing 
mutations (short red arrow) can help move a population to a region of an 

adaptive landscape with low curvature, from where an adaptive peak can be reached 
more easily (long red arrow) than without robustness-enhancing mutations (blue 
arrow). Strong selection can help increase the frequency of robustness-enhancing 
mutations, and more so if such mutations also enhance fitness. 


those of populations W and N during phase I 
evolution (fig. SI5B). 

Because computational stability predictions 
may be inaccurate when a protein harbors 
multiple amino acid changes (27), we also 
experimentally measured the stability of fluo- 
rescent proteins expressed in our populations. 
In these experiments, proteins from S pop- 
ulations showed greater stability at 65° to 80° 
C than those from W and N populations (fig. 
S8E). This prevalence of more stable proteins 
in S populations also persisted after muta- 
genesis (fig. SSF). In sum, selection can purge 
strongly deleterious mutations even in the 
presence of robustness-enhancing mutations. 


Robustness-enhancing mutations help 
increase genetic diversity in populations 
subject to selection 


Though not completely preventing the purging 
of deleterious mutations, robustness-enhancing 
mutations can still increase genetic diversity 
within a population by helping a population 
tolerate some deleterious mutations. Spe- 
cifically, we observed that in both S and W 
(but not N) populations, proteins that carried 
robustness-enhancing mutations also harbored 
a significantly greater number of other amino 
acid changes at the end of phase I (one-sided 
t tests, P < 0.05; fig. S18). 

To further examine whether robustness- 
enhancing mutations help populations S tole- 
rate deleterious mutations during phase II 
evolution, we focused on the 22 mutations 
that reached a frequency exceeding 30% at 
the evolutionary end point. Among these muta- 
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tions, we identified four strongly deleterious 
mutations that significantly reduced the fluo- 
rescence of the universal green fluorescent 
genotype U by more than 20% (E18G, M79L, 
R110S, and N145S; two-sided Dunnett test 
with single-step adjustment, P < 0.001; fig. S5). 
Three of these deleterious mutations occurred 
in populations S, whereas only one of them oc- 
curred in any other population. Notably, more 
than 85% of the genetic variants that harbored 
one of these mutations also harbored at least 
one of the five foldability-improving mutations 
at the evolutionary end point (fig. S19). This 
suggests that robustness-enhancing mutations 
helped populations S tolerate strongly delete- 
rious mutations. It is also consistent with the 
observation that populations S experienced 
greater increases in both the number of ami- 
no acid changes and genetic diversity than 
populations W and N during phase II evolution 
(Fig. 6, A and B). 


Discussion 


Neutral or weakly deleterious mutations can 
play an important role in adaptive evolution 
(13, 28), because they can convey new functions. 
They also help populations respond rapidly to 
environmental changes (3, 7, 12, 13) and traverse 
fitness valleys created by epistatic (nonadditive) 
mutational interactions (29). Weak purifying 
selection facilitates the accumulation of such 
mutations (7). If they were central to the evolu- 
tion of a derived green fluorescence phenotype, 
our weak selection regime should have resulted 
in higher evolvability of this phenotype. How- 
ever, we found that strong selection led to 
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more rapid evolution of the green fluorescence 
phenotype (Fig. 1B). 

To understand why, consider that many 
proteins are marginally foldable (30) and mar- 
ginally stable (37). Also, most mutations ac- 
cumulated during evolution will further reduce 
protein foldability and stability (20, 32), es- 
pecially under weak selection (12, 33, 34). To 
function well, most proteins must first fold 
correctly (high foldability) and maintain struc- 
tural integrity after folding (high stability) 
(35), resulting in selection on the evolution of 
foldability and stability. As a result, strong selec- 
tion not only can stop the erosion of foldability 
and stability by purging deleterious mutations, it 
also favors mutations that enhance both proper- 
ties. This ability is especially important during 
the evolution of new phenotypes, because neo- 
functionalizing mutations usually destabilize 
proteins and reduce folding efficiency (5, 34, 36). 
In our experiments, strong selection favored 
mutations that increase foldability and, to an 
even greater extent, mutational robustness. In 
doing so, selection enhanced the penetrance 
of beneficial mutations (Fig. 5A) and accel- 
erated selective sweeps (Fig. 2C). In addition, 
foldability-improving mutations may also pro- 
mote the fixation of neofunctionalizing muta- 
tions by improving their specific protein 
activity during adaptive evolution (table S). 
We anticipate that our observations apply to 
the evolution of most proteins in which fold- 
ability and stability are important. 

Our experiments required high mutation 
rates so that we could observe adaptive evolu- 
tion on a laboratory time scale. Comparable 
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mutation rates have been observed in viruses 
(37, 38) and microbes, especially those that are 
challenged to evolve rapidly by environmental 
stressors (39-42). Such high mutation rates 
are also required for the evolution of robust- 
ness as a direct response to mutation pressure 
(3, 4, 43). However, because our evolvability- 
enhancing mutations increase both robust- 
ness and foldability (Fig. 6C), their direct 
fitness benefit (fig. S20) would help them 
accumulate also under much lower mutation 
rates. An important question is whether the 
advantage of strong selection for evolvability 
also persists at low mutation rates. 

Our work highlights the important role of 
standing variation for evolvability (13, 14, 20, 44-46). 
It shows that natural selection can play a crucial 
and active role in creating standing variation 
that is both beneficial and enhances evolvability— 
for example, by increasing robustness to dele- 
terious mutations. This contrasts with some 
theoretical and experimental work, in which 
first-order selection for fitness conflicts with 
second-order selection for robustness (47, 48). 
We predict that evolution can avoid this con- 
flict when mutations with both fitness-enhancing 
and evolvability-enhancing roles exist. 

Our work also suggests experimental de- 
signs that select for beneficial mutations without 
depleting genetic variation before selecting for 
a new phenotype. Such experiments could be 
further enhanced by starting with protein 
variants engineered for high robustness. Fur- 
thermore, our observations extend beyond 
bioengineering. We expect that the evolution- 
ary rescue of populations after environmental 
challenges, like climate change, may be easier 
in cases when a population’s evolutionary po- 
tential has been previously enhanced by strong 
selection. Most generally, our observations sug- 
gest that natural selection can create favorable 
conditions for Darwinian evolution. 


Materials and methods summary 
Plasmids and strains 


We used the plasmid vector pBAD202/D-TOPO 
(K4202-01, Invitrogen) for cloning and ex- 
pressing YFP alleles. This vector contains a 
kanamycin resistance marker and an arabinose- 
inducible araBAD promoter. We used E. coli 
strain BW27783 (CGSC 12119) as a host to enable 
the homogeneous expression of the arabinose- 
inducible araBAD promoter (49). 


Creating mutant libraries 


We used the same mutagenesis protocol during 
phase I and phase II of our experiments (/7). 
Specifically, we performed mutagenic PCR 
(using the primers MutafpF and MutafpR, 
table S8) to randomly introduce mutations 
into the coding region of YFP, inserted the re- 
sulting mutant pool into the vector backbone 
by ligation, and then electroporated the liga- 
tion product into electrocompetent cells (77). 
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After electroporation, we immediately added 
1 ml of prewarmed SOC medium, incubated the 
recovering cell culture for 1.5 hours at 37°C with 
shaking at 220 rpm in a 50-ml tube, and used 
the recovered culture for further experiments. 

To avoid the accumulation of mutations in 
the plasmid outside the YFP coding region, we 
inserted randomly mutated YFP genes into a 
fresh plasmid backbone in each generation. In 
addition, to avoid mutations that might ac- 
cumulate in the E. coli genome, we transformed 
mutated YFP gene pools into fresh E. coli com- 
petent cells in each generation. 


Directed evolution 


We induced fluorescent protein expression in 
evolving populations at 37°C for 12 hours (with 
shaking at 220 rpm) by using 0.2% arabinose 
(17). We used an Aria III cell sorter (BD 
Biosciences) to sort cells at 4°C in the fluores- 
cein isothiocyanate (FITC) channel (A,, = 
488 nm, Aem = 530 + 15 nm; phase I evolution) 
or in the AmCyan channel (A,, = 405 nm, 
Nem = 525 + 25 nm; phase II evolution) accord- 
ing to the selection criteria described in Fig. 1. 
In phase I evolution, we collected ~10° selected 
cells in ~1 ml of cold phosphate-buffered saline 
(PBS) buffer at 4°C for each replicate popula- 
tion (17). In phase II evolution, we first selected 
~5 x 10° cells with the top 1% of green 
fluorescence intensity, regrew the sorted cells, 
and repeated the sorting process by again select- 
ing ~10* cells in the top 1% of green fluorescence 
intensity (17). In each round, we regrew sorted 
cells, and isolated plasmids from sorted cells and 
used them as templates for the next mutation- 
selection cycle and for single-molecule real-time 
(SMRT) sequencing (17). To prevent cell prolif- 
eration or death, we placed selected cells on ice 
before the subsequent steps. 


Engineering YFP variants 


We used whole-plasmid PCR to engineer single 
mutants and some double mutants by design- 
ing primers that carry the corresponding muta- 
tions (table S9) (77). We also used whole-plasmid. 
PCR to engineer single mutations into the 
genetic backgrounds of G66S, Y204C, or U 
(G66T + Y204C) by using the mutants G66S, 
Y204C, or U as templates (17). We used Gibson 
Assembly Master Mix (E2611, NEB) to engineer 
the double-mutant U (G66T+Y204C) by using 
the primers G66Sf/Y204Cr and Y204Cr/G66Sf 
(table S9) (17). 


Fluorescence assay using flow cytometry 


We grew evolving populations or engineered 
variants in 200 ul or 2 ml of LB with 0.2% 
arabinose to induce the expression of YFP 
variants in evolving populations or engineered 
YFP variants (17). We added 40 ul of a culture 
to 160 ul of cold PBS buffer, transferred 20 ul of 
the resulting suspension to 180 ul of cold PBS 
buffer, and mixed the solution thoroughly. We 
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used the resulting mixture to measure yel- 
low fluorescence (Aex = 488 nm and Agm = 
530 + 15 nm) and green fluorescence (ex = 
405 nm and Agm = 525 + 25 nm) using flow 
cytometry. We performed fluorescence as- 
says at room temperature with a flow rate of 
~3000 events/s by using a Fortessa cell ana- 
lyzer (BD Biosciences). We performed at least 
three biological replicates for each replicate 
population or each variant and analyzed ~10* 
events per replicate. To prevent cell prolifer- 
ation or death, we placed all samples on ice 
until we had finished all assays. 


Flow cytometry data analysis 


We performed flow cytometry data analysis by 
using FlowJo V10.4.2 (LLC). Briefly, we used 
forward scatter height (FSC-H) versus side 
scatter height (SSC-H) density plots to select 
a homogeneous cell population (p1 in fig. S21A), 
and used side scatter area (SSC-A) versus SSC-H 
density plots to exclude doublets (p2 in fig. 
S21B). We used the resulting filtered data p2 
for determining the fluorescence intensity of 
evolving populations and for determining mu- 
tational robustness of evolving populations 
and those single mutants (Figs. 1, 3, 4, and 
5 and fig. S12). We used FITC-height versus 
AmCyan-height density plots to select the domi- 
nant cell population (p3 in fig. S21C) and used 
the resulting filtered data p3 for determining 
the fluorescence intensity of each mutant (figs. 
S5 and S20). 


Protein solubility determination and refolding 
kinetics measurements 


We induced the expression of YFP variants in 
evolving populations or engineered mutants 
by growing cells in 2 ml of LB medium with 
50 ug/ml of kanamycin and 0.2% arabinose 
in a 10-ml tube at 37°C and at 220 rpm for 
12 hours (17). We used CelLytic B Cell Lysis 
Reagent (B7435-500ml, Sigma) to extract both 
soluble and insoluble proteins from the col- 
lected cells by following the manufacturer’s 
protocol (17). We quantified the amount of 
both soluble and insoluble proteins in each 
sample by SDS-polyacrylamide gel electro- 
phoresis (SDS-PAGE) (17). 

To unfold proteins, we diluted 5 ul of crude 
lysate with 45 ul of 9 M urea (containing 
10 mM dithiothreitol) and incubated the solu- 
tion at 95°C for 5 min. To refold unfolded pro- 
teins, we diluted 10 ul of the unfolded samples 
with 180 ul of TNG buffer in a 96-well microplate 
and used an infinite F200 Pro (A,, = 485 nm, 
Nem= 530 nm) or a Spark 10 M (A,, = 485 nm, 
Nem = 530 nm and A,, = 405 nm, Aem = 512 nm) 
microplate reader to measure fluorescence 
intensity at ~20-min intervals (17). 


SMRT sequencing and sequence data analysis 


We barcoded YFP variants of each replicate 
population by using PCR and ligation for SMRT 
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sequencing and used the Pacific Biosciences 
RS2 instrument (Pacific Biosciences) to per- 
form sequencing (17). We used the SMRTA- 
nalysis v2.3 package (50) to perform primary 
data analysis (17). We wrote Python scripts 
(Python 2.7.12) to identify point mutations 
and their combinations and to calculate genetic 
diversity in each replicate population from each 
generation of evolution (17). 


Statistical analysis 


Unless specified otherwise, we conducted pair- 
wise comparisons by using a one-tailed ¢ test 
and conducted multiple comparisons to a con- 
trol by using a one-sided Dunnett test with 
single-step adjustment. We performed all 
Statistical analysis using R version 3.4.1. 
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INTRODUCTION: Discrimination of self from 
nonself is pivotal for cellular organisms, as it 
allows the perception of pathogenic invaders 
that might otherwise multiply unchecked and 
cause disease in the host. To recognize nonself 
and to repel intruders, multicellular organ- 
isms deploy complex immune systems, in 
which repertoires of dedicated immune re- 
ceptors play a central role. Innate immunity 
is an evolutionarily ancient arm of immunity 
in plants and animals that relies on struc- 
turally related, germline-encoded receptors. 
One class of these immune receptors inside 
cells, called the NLR protein family, shares a 
nucleotide-binding domain and leucine-rich 
repeats (LRRs). Plant sensor NLRs are clas- 
sified into two main groups that are defined 
by different N-terminal domains: a coiled- 
coil (CC) domain in CC-NLRs (CNLs) and a 
Toll-interleukin-1 receptor (TIR) domain in 
TIR-NLRs (TNLs). A deeper understanding 
of the principles that govern nonself recog- 
nition by NLRs and their activation of in- 
nate immune responses necessitates protein 
structure-based approaches and reconstitu- 
tion of signaling-active receptor complexes. 


RATIONALE: Host-adapted plant pathogens se- 
crete numerous effectors into the host extra- 
cellular spaces or inside cells. These effectors 
promote virulence, often by interfering with 
defense responses. Plant NLRs typically de- 
tect strain-specific pathogen virulence factors 


(effectors) delivered into host cells. This trig- 
gers immune responses that curtail pathogen 
proliferation and often culminate in localized 
host cell death. During plant host-pathogen 
coevolution, positive selection of random 
mutations in effector genes that abrogate 
NLR recognition drives the diversification 
of NLR repertoires at the population level. 
A well-studied coevolved pathosystem involves 
Arabidopsis thaliana and the downy mildew 
pathogen Hyaloperonospora arabidopsidis 
(Hpa). The A. thaliana TNL receptor RPP1 
confers strain-specific immunity through 
recognition of Hpa effector ATRI. Specific 
allelic variants of ATR1 in Hpa populations 
activate only certain RPP1 variants in par- 
ticular A. thaliana accessions. Previous work 
detected a nicotinamide adenine dinucleotide 
(NAD")-consuming enzymatic activity medi- 
ated by the N-terminal TIR domains of TNLs. 
How the TIR-associated NAD* hydrolase (NADase) 
activity and downstream signaling is enabled 
by TNL effector recognition is unknown. 


RESULTS: We coexpressed a naturally occurring 
A. thaliana RPP1 receptor variant with its 
matching Hpa effector ATRI in insect cells. 
Protein purification revealed an oligomeric 
protein complex of ~600 kD consisting of 
RPP1 and ATRI, which we term the “RPPI1 
resistosome.” Biochemical assays showed that 
the RPP1 resistosome displays much higher 
Meg”*/Ca?*-dependent NADase activity than 
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formed by asymmetric 
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active center 





TIR holoenzyme Plant cell death 


Pathogen activation of an NLR holoenzyme. Recognition of pathogen effector ATR1 via the C-JID and LRR 
domains of the plant TNL receptor RPPI triggers the assembly of a tetrameric receptor complex with two 
asymmetric N-terminal TIR domain homodimers. This tetramer-induced TIR asymmetry creates, via two centrally 
located BB-loops, active sites for NAD” hydrolysis, which is essential for RPP1 signaling leading to host cell death. 
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RPP1 alone. Using cryo-electron microscopy, 
we resolved a structure of the oligomeric com- 
plex that contains four RPP1 and four ATR1 
molecules and reveals a tetrameric assembly 
mediated entirely by RPP1 subdomains. In con- 
trast to other adenosine triphosphate (ATP)- 
bound NLRs in their active forms, RPP1 in the 
resistosome is adenosine diphosphate-bound, 
probably because of the lack of a motif required 
for ATP binding. The structure also reveals 
direct binding of ATR1 to a C-terminal jelly roll/ 
Ig-like domain (C-JID) and the LRRs of the 
RPP1 receptor. Protein sequences correspond- 
ing to contact regions between the receptor and 
the pathogen effector are polymorphic in natu- 
rally occurring RPP1 and ATRI variants, ex- 
plaining why only certain RPP1 variants can 
detect strain-specific ATR1 molecules. The 
sequence-diversified RPP1 is shared by 
many other TNLs, but not CNLs, in diverse 
plant species and might serve a role in the 
detection of other pathogen effectors. Receptor 
tetramerization creates two potential NADase 
active sites, each formed by an asymmetric TIR 
homodimer. Structure-guided substitutions of 
residues at this homodimeric TIR interface 
abolished ATR1-induced cell death in planta, 
supporting an essential role of the TIR-TIR 
interface in RPP1 function. Our combined bio- 
chemical and in planta assays show that as- 
sembly of two asymmetric TIR homodimers 
by the tetrameric receptor complex is respon- 
sible for NAD” hydrolysis and RPP1-mediated 
immune signaling. 


CONCLUSION: Our findings indicate that the 
RPP1 resistosome acts as a pathogen effector- 
inducible holoenzyme for NAD” hydrolysis. The 
tetrameric RPP1 oligomeric structure provides 
an example of direct pathogen effector recog- 
nition by a plant NLR receptor and uncovers the 
mechanism of strain-specific recognition leading 
to NLR conformational activation. Our work 
suggests a multilayered regulation of RPP1 
tetramerization, including ATR1 binding, RPP1 
oligomerization driven by interactions among 
nucleotide-binding domains, and RPP1""* self- 
association. The analysis provides a structural 
insight to induced NAD” hydrolysis mediated 
by the RPP1 holoenzyme and a framework for 
TNL receptor signaling. As a holoenzyme, the 
RPP1 resistosome bears similarity to the 
animal apoptosome and inflammasome, which 
form holoenzymes after the recruitment of 
procaspases. 
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Direct or indirect recognition of pathogen-derived effectors by plant nucleotide-binding leucine-rich 
repeat (LRR) receptors (NLRs) initiates innate immune responses. The Hyaloperonospora arabidopsidis 
effector ATR1 activates the N-terminal Toll-interleukin-1 receptor (TIR) domain of Arabidopsis NLR RPP1. 
We report a cryo-electron microscopy structure of RPP1 bound by ATR1. The structure reveals a 
C-terminal jelly roll/lg-like domain (C-JID) for specific ATR1 recognition. Biochemical and functional 
analyses show that ATR1 binds to the C-JID and the LRRs to induce an RPP1 tetrameric assembly 
required for nicotinamide adenine dinucleotide hydrolase (NADase) activity. RPP1 tetramerization 
creates two potential active sites, each formed by an asymmetric TIR homodimer. Our data define 
the mechanism of direct effector recognition by a plant NLR leading to formation of a signaling- 


active holoenzyme. 


ntracellular nucleotide-binding leucine- 

rich repeat immune receptors (NLRs) 

have evolved independently in plants 

and animals to detect pathogen distur- 

bance. Plant sensor NLRs are classified 
into two main groups that are defined by 
different N-terminal domains: a coiled-coil 
(CC) domain in CC-NLRs (CNLs) and a Toll- 
interleukin-1 receptor (TIR) domain in TIR- 
NLRs (TNLs). Direct or indirect recognition 
of pathogen effector proteins by plant NLRs 
triggers an immune response termed effector- 
triggered immunity (1-6), often characterized 
by rapid host cell death (a hypersensitive re- 
sponse) at sites of attempted infection. In 
addition to the C-terminal LRR domain, non- 
canonical integrated domains of plant NLRs 
play a critical role in conferring specific ef- 
fector recognition (7). Although modes of ef- 
fector recognition vary, ligand sensing is 
widely believed to induce oligomerization 
of NLRs for signaling. For example, a re- 
cent structural study of the CNL ZARI in 
Arabidopsis showed that the ZARI resisto- 
some induced by bacterial effector AvrAC 
assumes a wheel-like structure similar to that 
of NLR inflammasomes in animals (8). A body 
of evidence suggests that the CC domains of 
CNLs and the TIR domains of TNLs mediate 
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signaling upon NLR activation (9-14). A TIR 
domain signaling role is further supported 
by biochemical assays that detected NADase 
activity required for TNL-mediated immunity 
(15, 16). Structural and functional studies 
with individual TIR domains revealed their 
capacity for self-association as homo- or 
heterodimers, which is important for immu- 
nity induction (5, 9, 17, 18). How TIR domain 
signaling activity is enabled by TNL effector 
recognition in the context of full-length re- 
ceptors is unknown. 

Members of the Arabidopsis RPP1 (Recog- 
nition of Peronospora parasitica 1) TNL family 
specifically recognize cognate ATRI (Arabidopsis 
thaliana Recognized 1) effector variants 
produced by the foliar oomycete pathogen 
Hyaloperonospora arabidopsidis (Hpa) (19, 20). 
In host and pathogen populations, both RPPI 
from Arabidopsis accessions and ATRI from 
Hopa strains are highly polymorphic. Rec- 
ognition of different ATR1 forms by RPP1 
variants is Hpa race-specific, indicative of 
host-pathogen coevolution (12, 21). Specific 
RPP1-ATRI recognition in Arabidopsis lead- 
ing to leaf macroscopic host cell death was 
recapitulated in tobacco transient gene ex- 
pression assays. Coupled with biochemical 
data, these studies showed a requirement for 
the RPP1 C-terminal LRR domain in direct 
binding of recognized ATRI forms (20, 22). 


Reconstitution and cryo-EM structure of the 
RPP1 resistosome 


We coexpressed Strep-RPP1_WsB (residues 61 
to 1221) with His-ATR1_Emoy2 (residues 52 to 
311) as a matching TNL-effector pair (72, 27) in 
insect cells. We used a tandem affinity puri- 
fication procedure to isolate the complex. Gel 
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filtration analysis showed that comigration of 
the RPP1 and ATRI proteins corresponded to a 
molecular weight of ~600 kD (fig. S1), indicat- 
ing that they form an oligomeric complex, 
which we term the “RPP1 resistosome.” The 
complex obtained from gel filtration was used 
for structural analysis by cryo-electron micros- 
copy (cryo-EM). After three-dimensional (3D) 
classification, a subset of 409,348 particles was 
used for image reconstruction, generating a 
map with a global resolution of 3.16 A, as 
determined with a gold-standard Fourier shell 
correlation (Fig. 1A and fig. S2). Resolution of 
the core complex without inclusion of the LRR 
portion was 2.99 A. 

The cryo-EM structure of the RPP1 resisto- 
some contains four RPP1 and four ATR1 mole- 
cules, which assemble into a tetrameric complex 
measuring 160 A x 160 A x 120 A (Fig. 1B, fig. S3, 
A and B, and table S1). Tetramerization of the 
resistosome is mediated entirely by RPP1 sub- 
domains. We discovered in the resistosome 
structure that RPP1 has a C-terminal domain 
in addition to the canonical TNL domains 
TIR, nucleotide-binding domain (NBD), helix 
domain 1 (HD1), winged helix domain (WHD), 
and LRRs (Fig. 1B and fig. S3, C to F). Struc- 
tural searches of the Protein Data Bank (PDB) 
revealed that the C-terminal domain adopts a 
classical B-jelly roll and Ig-like fold (fig. S4A), 
which we designate C-JID (C-terminal jelly roll 
and Ig-like domain). The RPP1°~"” structure is 
similar to that of the TNL Roq] (23) (fig. S4B). 
Whereas all canonical TNL-type domains of 
RPP1 are involved in resistosome assembly, 
RPP1°%!” mediates interaction with ATR], 
assisted by the inner surface of RPP1'** (Fig. 
1B). The RPP1 resistosome is organized into 
a three-layered ring structure, with the top, 
middle, and bottom formed by the TIR do- 
main, the NOD (nucleotide-binding oligomer- 
ization domain) module (1.e., NBD-HD1-WHD 
domains), and ATR1-bound LRR-C-JID, re- 
spectively (Fig. 1B). RPP1N®”, RPP1!#?! and 
RPP1”"” are positioned similarly to the cor- 
responding domains of activated ZARI, 
NLRC4, and Apaf-1 (fig. S4C), indicating that 
RPP1 adopts an active conformation in the 
resistosome. 


RPP1°~"" is an essential structural 
determinant for ATR1-specific recognition 


RPP1-bound ATR1 is nearly identical to a crys- 
tal structure of the ATR1 monomer alone (24) 
(fig. S4D). RPP1'®® and RPP1°""” are com- 
paratively less well defined than other RPP1 
domains (fig. S2C). The cryo-EM density of 
RPP1-R® and RPP1°~"” was substantially im- 
proved by local refinement, sufficient for model 
building (figs. S2C and S3, G to I). ATRI 
obliquely contacts RPP1©?” and RPP1"®® via 
its N-terminal segment (residues 67 to 190) 
(Fig. 2A), consistent with an earlier report 
that residues 68 to 222 of ATRI are sufficient 
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Fig. 1. Tetrameric assembly of the RPP1 resistosome. (A) Final 3D reconstruction of the RPP1 resistosome 
(oligomeric RPP1-ATR1 complex) in three orientations. (B) Final model of the RPP1 resistosome in three 
orientations. The reconstruction and model in each row are shown in the same orientation. Color codes for 


ATRI and subdomains of RPPI are indicated. 


for RPP1 recognition (24). ATR1 interacts mainly 
with a flat exposed anti-B sheet surface of 
RPP1 (interface 1), establishing hydro- 
gen-bonding and hydrophobic interactions 
(Fig. 2B and fig. S5A). Contacts with non- 
conserved residues from the inner surface of 
the curved RPP1"** further contribute to ATRI 
interaction with RPP1 (interface 2) (Fig. 2C and 
figs. S5B and S6). 

To verify the cryo-EM structure, we made 
substitutions of residues in ATRI and RPPI1 
from interface 1 or 2, or both together. The 
various mutants were coexpressed in insect 
cells. Asp’*° of ATR1_Emoy2 (ATR1?™°) that is 
conserved in RPP1_WsB-recognized ATRI1_ 
Maks9 and ATR1_Emco5, but not in non- 
recognized ATR1_Cala2 and ATRI_Emwal 
(fig. S7), is located at the center of interface 
1 (Fig. 2B). Substitution of this residue with a 
tyrosine present at the equivalent position of 
ATRI_Cala2 and ATR1_Emwal substantially 
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reduced ATR1-Emoy2 binding to RPP1 in vitro 
(fig. S8A), indicating that interaction with 
RPP1°""” is important for ATR1 recognition. 
This result also explains why the ATR1_Cala2 
and ATR1_Emwal alleles are not recognized 
by RPP1_WsB (12, 21). In further support of 
the cryo-EM structure, alanine substitutions 
of five residues from the loop region (E117A/ 
L122A/D124A/T125A/Y126A) of ATR1 at inter- 
face 2 resulted in loss of interaction with RPP1 
(fig. S8A). Substitutions of RPP1 residues at 
either of the two interfaces with amino acids 
at equivalent positions in other RPP1 variants 
(fig. S6) impaired interaction with ATR1_ 
Emoy2 (fig. S8A). 

We tested whether the above RPP1 and 
ATRI1 substitutions affected ATR1-induced 
RPP1-dependent host cell death by using 
Agrobacterium tumefaciens-mediated tran- 
sient gene expression to coexpress untagged 
RPP1 and hemagglutinin (HA)/StrepII (HS)- 
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tagged ATRI forms in leaves of Nicotiana 
benthamiana. Expectedly, coexpression of 
wild-type (WT) RPP1 with WT ATRI-HS re- 
sulted in cell death of infiltrated leaf zones 
(Fig. 2D). Epitope-tagged WT RPPI-HS also 
produced ATR1-dependent cell death in 
N. benthamiana and was detectable as a 
~140-kD protein by immunoblotting (fig. S8, 
B and C). By contrast, cell death was strongly 
impaired with ATR1 proteins bearing sub- 
stitutions that abolished or reduced inter- 
action with RPPI (Fig. 2, C and D, and fig. 
S8D). The loss of cell death induction by 
ATRIEN’/ L122A/D124A/T125A/Y126A with multi- 
ple substitutions at interface 2 (Fig. 2, C and 
D) showed that in addition to RPP1°”””, 
RPP1'** is important for ATR1 recognition, 
in agreement with previous data (20, 22). 
Coexpression of RPP1 with ATRIR17“4/"1794 
that had no detectable interaction with RPP1 
in vitro (fig. S8A) caused WT-like cell death 
(Fig. 2, C and D). The reason for the dis- 
crepancy remains unclear, but it is possible 
that a weak interaction between this ATRI 
mutant and RPP1 was undetectable in vitro 
but was sufficient to support cell death. The 
RPP1 substitutions RPPS 35/S025Y/N1081R/S183R 
from interface 1 retained WT-like cell death ac- 
tivity when coexpressed with ATRI, but cell death 
was abolished with RPP1%80°P/88958/Y935E/R937™M 
substitutions from interface 2 (Fig. 2E). Col- 
lectively, our data indicate that RPPI™™® and 
RPPi'®* are structural determinants for 
ATRI-specific binding and recognition of nat- 
urally occurring ATRI variants. In support of 
this conclusion, a structure-guided RPP1 se- 
quence alignment revealed that residues from 
these two domains, in particular those from 
RPP1°""”, are variable between different RPP1 
proteins (fig. S6). 


Assembly of the RPP1 resistosome is required 
for NADase activity 


As observed in oligomerization of other NLR 
proteins (8, 25-27), the RPP1 central NOD 
module (NBD-HDI-WHD domains) partici- 
pates in tetramerization (Fig. 3A and fig. S9, 
A and B). In contrast to other NLRs, however, 
the loop region between 82 and a2 of RPP1\”” 
mediates RPP1 tetramerization by interacting 
with a groove between RPP1N”” and RPP1™"” 
of the adjacent protomer around the P-loop 
region (Fig. 3B and fig. S9C). Besides self-as- 
sociation (discussed below), the RPP1""* stacks 
against RPP1‘®” from an adjacent protomer 
(Fig. 3C and fig. S9, A, D, and E). Two adjacent 
RPP1''8s are positioned differently to engage 
in distinct interactions with RPP1N®” (Fig. 3A). 
A similar observation was reported for N- 
terminal CARD domains in the CED-4 apopto- 
some (28). In the RPP1 tetramer, the TIR loop 
region N-terminal to RPP1‘®” (TIRa) is better 
defined than that of its neighboring TIR 
(TIRb). RPP1'!** packs tightly against the top 
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Fig. 2. Structural mechanism of ATR1 recogni- 
tion by RPP1. (A) Structure of ATR1 (in cartoon) 
bound by the RPP1R®-°~ fragment (in surface) 
showing two interfaces with ATRI1. (B) Detailed 
interactions between RPP1°""? and ATRI at 
interface 1 [blue frame in (A)]. Red dashed lines 
indicate polar interactions. (C) Detailed interactions 
between RPP1®® and ATRI at interface 2 [purple 
frame in (A)]. Red dashed lines indicate polar 
interactions. (D) Host cell death triggered by 
coexpression of untagged RPP1_WsB and HS-tagged 
ATR1_Emoy2 variants in leaves of N. benthamiana. 
WT, wild type. Cell death was quantified by a leaf 
disk ion leakage (conductivity) assay at 3 days after 
agro-infiltration (dai). Data are normalized to the 
mean value for samples with RPP1_WsB and WT 
ATR1_Emoy2 in each experiment. Results from three 
independent experiments are displayed on the plot 
(n = 18; Tukey's HSD test, a = 0.01; data points of 
the same color were recorded in one experiment; 
Shared lowercase letters indicate no significant 
difference). (E) Top: lon leakage assay of RPP1_WsB 
mutations at RPP1-ATR1 interfaces 1 and 2 on 
ATR1_Emoy-induced cell death in N. benthamiana. 
The assay was performed as described in (D) after 
agro-infiltration of C-terminally HA-Strepll-tagged 
RPP1_WsB (RPP1_WsB-HS) with ATR1_Emoy2-HS. 
Statistical analysis via Tukey's HSD test is based on 
data from three independent experiments (n = 18, 
a = 0.001; data points of the same color were 
recorded in one experiment; shared lowercase 
letters indicate no significant difference). Below are 
photographs of representative agro-infiltrated leaf 
zones at 4 dai. Bottom: Western blot analysis of 
total N. benthamiana leaf protein extracts at 

2 dai probed with antibody to HA. Expression of 
RPP1_WsB-HS WT and mutant proteins produces a 
signal of the expected molecular weight (~140 kD). 
Ponceau S staining indicates equal loading of total 
leaf proteins on the blot. 


of RPP1N®” from an adjacent protomer via 
extensive interactions of C-terminal parts of 
helices aA and aE (Fig. 3C and fig. S5C). Less 
tight interactions are formed between TIRb 
and its adjacent RPPIN®” (fig. S9D). As ob- 
served in the ZARI resistosome, interactions 
between two neighboring LRR domains (fig. 
S9, A and E) likely further stabilize the RPP1 
tetrameric resistosome. 

We next investigated whether formation of 
the RPP1 resistosome is required for NADase 
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activity. A high-performance liquid chroma- 
tography (HPLC) assay with recombinant pro- 
teins purified from insect cells showed that the 
RPP1-ATR1 complex, but not RPP1 alone, 
hydrolyzed nicotinamide adenine dinucleotide 
(NAD*) at 1.0 uM protein (Fig. 3D). Divalent 
ions were shown to be important for nucleo- 
side hydrolase activities (29). We therefore 
tested whether Mg”* or Ca?* changed RPP1- 
ATRI tetramer NAD* hydrolysis. Addition of 
10 mM Mg”* strongly promoted consumption 
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of NAD* by RPP!1 (Fig. 3D). The same con- 
centration of Ca?* had a weaker effect. By 
contrast, Mg°* or Ca” did not increase the 
negligible NADase activity of RPP1 alone 
(Fig. 3D). These data show that assembly of 
the RPP1 tetramer is necessary for NAD* 
hydrolysis. Notably, the same concentration 
(1.0 uM) of RPP1'** (residues 60 to 254) puri- 
fied from insect cells was much less efficient 
in NAD* hydrolysis (fig. S10A). At a higher 
concentration (70 1M), RPP1'"* NADase activity 
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Fig. 3. The RPP1 resistosome, but not RPP1 alone, has Mg**/Ca**-dependent NADase activity. (A) A 
lateral RPP1 homodimer shown in cartoon. aA and aÈ are helices from the TIR domain (TIRa) in the left RPP1 


protomer that interact with RPP1N®? 


of the right protomer. The dashed line in the right RPP1 indicates the 


flexible region C-terminal to the TIR domains. (B) Detailed interactions of the B2-a2 loop from the right 
protomer with RPP1N®° and RPP1“"? from the left protomer highlighted within the blue frame in (A). 

(C) Detailed interactions between RPP1''®? and RPP1“®™ highlighted within the red frame in (A). (D) NADase 
activity assay of the ATR1-RPP1 complex and RPP1 alone. The purified ATRI-RPP1 complex or RPP1 only was 
incubated with 100 uM NAD* in buffer with or without 10 mM Mg**/Ca**. After incubation at 25°C for 

16 hours, reaction mixtures were centrifuged and immediately used for HPLC analysis. Reaction completion 
(%) of each sample was calculated as [1 - (concentration of unhydrolyzed NAD*)/(concentration of NAD* 


before reaction)| x 100%. 


was increased and promoted by Mg”* and 
Ca?* (fig. S10A), which suggests that high 
concentrations of TIR alone may drive it 
into oligomers with enzymatic activity (5). 
This notion is further supported by an earlier 
finding of RPP1"® in vivo autoactivity that 
correlated with its self-association in solution 
(30). Also, induced TIR domain proximity led 
to cell death in planta (37). Collectively, our 
data indicate that ATR1-induced assembly of 
the RPPI1 resistosome is required for RPP1 
NADase activity and host cell death induc- 
tion. Thus, the RPP1 resistosome can be viewed. 
as a pathogen-inducible holoenzyme for NAD* 
hydrolysis. 


Active sites are formed by asymmetric 
TIR homodimers 


Whereas the four RPP1X°” modules in the 
RPP1 resistosome are approximately related 
with C, symmetry, the four TIR domains are 
related with C, symmetry because of the dif- 
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ferent positioning of two neighboring RPP1"""s 
(Fig. 4A). Thus, the tetrameric RPP1""*s con- 
tain two symmetric TIR homodimers that 
are nearly identical to those observed in the 
crystal structure of RPP1"!* (fig. S11, A and 
B) (18). Each of three tested substitutions of 
residues at this homodimeric “AE” interface 
(9, 17, 18) abolished ATR1-induced cell death 
in N. benthamiana (fig. S11C), supporting an 
essential role of the AE interface in RPP1 func- 
tion. Functional relevance of the AE interface 
has been observed for TIR domains of other 
plant TNLs (9, 17, 18). In the RPPI1 tetramer, 
opposite packing of the two symmetric TIR 
homodimers led to formation of two asymmetric 
head-to-tail RPP1'* homodimers (Fig. 4A). 
The two RPP1''%s in an asymmetric homo- 
dimer have different conformations in the 
loop between gA and aB (equivalent to the 
BB-loop of other TIR domains, hereafter 
called the BB-loop) (Fig. 4B). The BB-loop 
is well defined in RPP1''** but is disordered 
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in RPP1'!® (fig. S12A). Asymmetric head-to- 
tail RPP1'"*-RPP1'"* interaction is primarily 
mediated by the BB-loop of TIRa that con- 
tacts the opposite side of TIRb (Fig. 4, A and 
C, and fig. S12B). 

From a sequence alignment (fig. S12C), the 
predicted catalytic RPP1""® is located at one 
end of the groove formed within an asymmetric 
RPP1""® homodimer (fig. S13A), which suggests 
that this groove is important for RPP1-catalyzed 
NAD* hydrolysis and RPP1 function. To test 
this hypothesis, we mutated residues from the 
interface of the asymmetric RPP1'"* homo- 
dimer and evaluated the impact of these sub- 
stitutions on the NADase activity of the RPP1 
resistosome and ATRI-induced cell death in 
N. benthamiana. Substitutions RPP1 1121E, 
S124E, A222E, or G223A, which are predicted 
to disrupt the asymmetric RPP1'!*® homo- 
dimers, all interacted with ATRI (fig. S10B) 
but were strongly compromised in NAD* 
hydrolysis (Fig. 4D and fig. S13B). By con- 
trast, an RPP1R123A exchange had less ef- 
fect on NADase activity (Fig. 4D). In support 
of an essential role of NADase activity in RPP1 
function, we also found that RPP1 1121E, 
A222E, E158A, or E158Q, but not RPP1 R123A, 
S124E, E122A/R123A/S124A/K125A/S126A, or 
G223A, displayed almost undetectable cell 
death activity (Fig. 4E). Taken together, these 
data show that the assembly of two TIRa- 
TIRb active sites in the RPP1 resistosome is 
responsible for NAD* hydrolysis and RPP1- 
mediated signaling. 

A previous study showed that many single 
mutations in the TIR domain of the canonical 
Arabidopsis TNL RPS4 (RPS4™®*) disrupted 
the cell death activity of RPS4™*? in tobacco 
(13). Mapping the equivalent residues onto the 
RPP1 resistosome revealed that most of them 
cluster around the asymmetric TIRa-TIRb groove 
(fig. S13C). We found that adenosine triphosphate 
(ATP), which was supplemented during protein 
purification, bound to this groove in the absence 
of NAD” (fig. S13A) and that the ATP binding 
groove is conserved among Arabidopsis TNLs 
(fig. S12C). The bound ATP likely acts as an 
analog of NAD’ at the groove. Supporting this 
possibility, a structural comparison revealed 
that NAD* phosphate (NADP”) bound to the 
TIR domain of plant TNL RUNI (9, 17, 18) at a 
position similar to that of ATP in the RPP1 
TIRa-TIRb groove (fig. S13D). These results 
provide additional evidence for the biological 
relevance of the asymmetric RPP1'™® homo- 
dimers in the resistosome. 


ADP binds to the P-loop region of RPP1 
in the resistosome 


Previous studies demonstrated that structures 
of the NOD module from plant and animal 
NLRs are highly conserved in both inactive 
and active states (32). This is underscored here 
by the similar structures of RPP1X?” in the 
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RPP1 tetramer and ZARIN® in the ZARI 
B pentamer (fig. S4C). Furthermore, NLR acti- 
a vation involves conformational changes in 
the NBD relative to its C-terminal WHD, 
whereas no conformational change occurs 
in the WHD relative to its C-terminal portion, 
as demonstrated in activation of ZAR1, Apaf-1, 
and NLRC4 (8, 25-27). A modeling study using 
the inactive ZARI®” structure (33) as a tem- 
plate suggested that the LRR domain, in its 
pre-activation state, sequesters RPP1 from 
oligomerization, consistent with other NLR 
autoinhibition models (fig. $14). The modeled 
inactive RPP1 structure also suggested that 
ATRI binding would sterically clash with 
RPP1”? (fig. S14B), inducing conformational 
E changes in RPP1N®” for activation. Position- 
ing of inactive RPP1'" is difficult to predict 
because of the lack of a reliable template. 
However, overexpression of RPP1'™, but not 
RPP NPY induced cell death in N. benthamiana 
(12, 30). These data point to inhibition of 
RPP1'!8 by RPP1IN©”, presumably through 
interdomain interaction similar to that ob- 
served for the inactive ZARI™ (33). An inhib- 
itory interaction between the TIR and NBD 
domains was also suggested for TNLs L6 and 
L7 from flax (34). 
On the basis of current models (5, 32), we 
Ce K PVP Ss S expected an ATP molecule to be bound by 
wr oe? Qs © the NBD of activated RPP1. Surprisingly, an 
ee adenosine diphosphate (ADP) molecule, which 
is unambiguously defined by the cryo-EM 
density, binds to the P-loop of RPP1 in the 
D ~~ RPP{variants+ATR1 resistosome (Fig. 5A and fig. S5D). The ADP is 
RPP1-ATR1 (1 uM) recognized via RPP1 residues that are highly 
NAD* (100 uM) conserved in other NLR proteins (33, 35-37). 
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or polar residue, creating “TTE/Q” in many 
Fig. 4. Assembly of two active sites by two asymmetric RPP1"'® dimers in the tetrameric RPP1 resist- | known TNLs including RPP1 (fig. S15, A and 
osome. (A) Tetrameric assembly of RPP1''®s in the resistosome. Some secondary structural elements are B). In contrast to ZAR1 and Apaf-1, however, 
labeled. Two bound ATP molecules within the two asymmetric RPP1''® dimers (top and bottom dimer) are | RPP1 tolerates such substitutions, which sug- 


shown in stick. The loop region between aA and aB (BB-loop) from TIRa is framed in red and shown in gests that other interactions might compen- 
blue. (B) Structural alignment of the two RPP1''® monomers from the asymmetric homodimer. The BB-loop | sate for loss of ATP-mediated stabilization of 
from TIRa and TIRb is shown in blue and gray, respectively. The BB-loop is highlighted. (C) Detailed the RPP! resistosome. Indeed, the B2-a2 loop 
interactions of the BB-loop from TlRa with TIRb in the asymmetric dimer. (D) Mutagenesis analysis of the contributes to RPP! oligomerization by mediat- 
interactions shown in (C) using the NADase activity assay described in Fig. 3D. (E) Effects of mutations at sites | ing NOD-NOD interactions (Fig. 3B and Fig. 
mediating TlRa-TIRb interactions (AE interface) on RPP1_WsB-HS and ATR1_Emoy2-dependent cell death in 5B). In further support of this hypothesis, the 


N. benthamiana leaves. Host cell death was measured in a quantitative electrolyte leakage assay as described in | TNL Roql with the TT/SR motif has ATP 
Fig. 2, D and E. Top: Host cell death triggered by coexpression of RPP1_WsB-HS variants and ATR1_Emoy2-HS bound in its activated form and the B2-02 
in leaves of N. benthamiana. Statistical analysis via Tukey's HSD test is based on data from three independent loop is not involved in formation of the Roq1 
experiments (n = 18, a = 0.001; data points of the same color were recorded in one experiment; shared lowercase | resistosome (23), similar to what was observed 
letters indicate no significant difference). RPP1 mutants E158A and E158Q were included as additional negative in the ZARI1 resistosome (Fig. 5, B and ©). 


controls. Below are photographs of representative agro-infiltrated leaf zones at 4 dai. Bottom: Western blot Besides Rogl, some other TNLs also carry 
analysis of total N. benthamiana leaf protein extracts at 2 dai probed with antibody to HA. Expression of the TT/SR motif (fig. SI5A). Most of these 
RPP1_WsB-HS WT and mutant proteins produces a signal of the expected molecular weight (~140 kD). Ponceau | TNLs have a shorter B2-a2 loop relative to 
S staining indicates equal loading of total leaf proteins in the tested samples. those without the motif (fig. SI5A), providing 
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Fig. 5. Activation of RPP1 by ATR1. (A) ADP is bound between the NBD and HD1 domains in the cryo-EM 
structure of RPP1-ATR1. Red dashed lines represent polar interactions. A corresponding structure with EM 
density around the ADP-bound molecule is shown in fig. S5D. (B) Structural superposition of a lateral 
RPP1N°? (cyan), Rol’ (pink), and ZAR1N°? (gray) homodimer. For clarity, only the NBD from the left 
NOD monomer is shown. The B2-a2 loop is shown in blue and framed in red. (C) Structural superposition of 
RPP1N°° (cyan), Rogl%®” (pink), and ZARIN°?. Arg®*? and Arg®?” from the TTR motif of Rogl and ZARI, 
respectively, and Glu”? from the equivalent position of RPP1 are indicated. The bound ADP in RPP1 and 
ATP in Roql and ZARI1 are shown in stick. (D) ATPase activity assay for RPP1E8^ and the RPP1©°°"-ATR1 
complex with ATR1. The recombinant proteins were individually incubated with ATP (1.0 mM) in buffer 
containing 10 mM Mg**. After incubation at 25°C for 1 hour, samples were centrifuged and immediately 
used for HPLC analysis to measure ATP content. The vertical axis represents the ATP ratio after 


versus before the reaction. 


additional evidence for the above hypothesis. 
Collectively, these results might explain why 
ADP is bound in the RPP! tetramer, although 
exchange of ADP with ATP during RPP1 
activation remains possible. 

Bound ADP in the RPP1 resistosome might 
result from RPP1 intrinsic ATPase activity. To 
test this possibility, we used HPLC to measure 
the ATPase activity of the RPP1**“ resisto- 
some or nonactivated RPP1*°*“ (to avoid 
potential interference from RPP1 NADase 
activity). In the presence of 10 mM Mg”, 
RPP1®!°*“ alone displayed ATPase activity 
that was more potent than that of Apaf-1 by a 
factor of ~3 to 4 (Fig. 5D and fig. S16) (35, 39). 
In contrast, the ATR1-activated RPP1*°8“ 
resistosome had much lower ATPase activity 
under the same conditions. Similar results 
were obtained with the Apaf-1 apoptosome 
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(39). Hence, we propose that RPP1 ATPase 
activity also contributes to an ADP-bound 
resistosome. 


Discussion 


We reconstituted the RPP1 resistosome and 
solved its structure using cryo-EM at 3.16 A. 
The structure reveals the mechanism of race- 
specific recognition of ATR1 by RPP1, leading 
to NLR conformational activation. We have 
presented evidence for the multilayered regu- 
lation of RPP1 tetramerization, including 
ATRI binding (Fig. 2), RPP1‘°” oligomeriza- 
tion (fig. S9), and RPP1'" self-association 
(Fig. 4). RPP1 tetramerization results in 
the formation of two asymmetric RPP1''* 
homodimers, creating active sites for NAD* 
hydrolysis. Therefore, ATR1-induced receptor 
tetramerization links effector binding to the 
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regulation of RPP1 NADase activity. Structural 
and biochemical characteristics of the activated 
RPPI tetramer provide a template for under- 
standing canonical plant TIR-type NLRs. 

Besides the canonical TNL domains, the 
C-JID that could not be predicted by primary 
sequence was revealed in the cryo-EM struc- 
ture of the RPP1 resistosome. Biochemical and 
functional data show that this novel structural 
domain is a major determinant for specific 
ATRI recognition (Fig. 2). In contrast to inte- 
grated domains of plant NLRs that are suffi- 
cient for effector recognition (7), RPP1°7!? 
functions together with RPP1'** for specific 
recognition of ATR1. The sequence-diversified 
RPP“ is predicted by a hidden Markov 
model (HMM) to be shared by many TNLs in 
dicotyledonous plant species (fig. S17, A to F). 
It comprises part of a C-terminal domain in 
Arabidopsis TNL receptor RPS4 that confers 
effector-triggered immunity as a heterodimer 
with TNL RRSI (17, 40). Mutations within the 
predicted RPS4°"" (fig. S17E) disabled RRS1/ 
RPS4 immunity (40, 41), pointing to broader 
importance of the C-JID for TNL function. It is 
possible that the C-JID serves as a decoy that 
mimics common virulence targets of pathogen 
effectors (7). However, using HMM, we failed 
to detect the RPP1°""” in CNLs and non-NLR 
plant proteins (fig. S17, B to D). We therefore 
speculate that the C-JID in different TNLs 
plays a more generic role, together with LRRs, 
in TNL-specific detection of unrelated patho- 
gen effectors. Although direct RPP1 recogni- 
tion of ATR1 leads to resistosome formation, 
many plant NLRs perceive their cognate ef- 
fectors in an indirect manner typically involv- 
ing other host proteins (2). 

Assembly of NADase active sites is medi- 
ated principally by the TIR BB-loop, which 
undergoes a major conformational change af- 
ter RPP1 tetramer formation (fig. S18A). The 
BB-loop of RRS1'"* (four residues) is shorter 
than that of RPP1'"® and other TNL TIR do- 
mains (>9 residues; fig. S18B). The RRS1'"* 
BB-loop in the symmetric RRS1'* homodimer 
or the RPS4!®-RRS1""* heterodimer (7) can- 
not support an RPP1'"*-like asymmetric homo- 
dimer (fig. S18B). Therefore, RRS1"'* would 
sequester the symmetric RPS4/"® homodimer 
from self-associating into asymmetric homo- 
dimers, potentially explaining RRS1‘"*-mediated 
in planta suppression of RPS4''®-triggered 
cell death (17. Structural alignment reveals 
that the asymmetric RPP1'"* homodimer is 
similar to that of filaments formed by MAL"!® 
in animals (fig. SI8C) (42). However, there is 
no experimental evidence for a filament- 
forming activity of RPP1™ or other plant 
TIR domains. Notably, MAL™? can form 
cofilaments with TIR domains of other pro- 
teins such as human TLR4 and MyD88 (42). 
Many TIR-only genes are encoded in the ge- 
nome of Arabidopsis (43), and TNLs have 
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been proposed to recruit TIR-only proteins 
for signaling (44, 45). It will be interesting to 
determine whether the RPPI1 resistosome can 
serve as a platform for interactions with TIR- 
only proteins or other oligomerized TNLs to 
amplify TNL-mediated immune signals. 

Like the ZARI1 resistosome (8), the Apaf- 
1 apoptosome (27), and the NLRC4 inflamma- 
some (25, 26, 46), the RPP1 tetramer forms a 
ring-like structure, despite the different oligo- 
merization states of its components. The ZAR1 
and RPP! resistosomes contain, respectively, a 
stabilized funnel-shaped structure for plasma 
membrane targeting and a stabilized tetra- 
meric RPP1""* for NADase activity. This con- 
trasts with relatively flexible apoptosome and 
inflammasome N-terminal CARD domains. 
Hence, the N-terminal domains of plant and 
animal NLRs likely engage different mecha- 
nisms for immune signaling. Nonetheless, a 
shared principle in signaling mediated by the 
ZARI resistosome and NLRC4 inflammasome 
was proposed (8). The ZAR1 resistosome 
and probably many other CNLs with an N- 
terminal “MADA” motif (47) might form a 
membrane-resident pore or channel to medi- 
ate immunity. This is conceptually analogous 
to signaling mediated by the NLRC4 inflam- 
masome that activates the pore-forming pro- 
tein GSDMD for immune responses through 
the protease caspase-1 (48). The RPP1 resist- 
osome, as a holoenzyme, bears similarity to 
both the apoptosome and the inflammasome, 
which form holoenzymes after recruitment of 
procaspase-9 and procaspase-l, respectively. 
Building on current models of TNL-mediated 
immune signaling, the RPPI1 resistosome 
NADase activity is responsible for activa- 
tion of HeLo domain-containing signaling 
(helper) NLRs via the EDS1 family of lipase- 
like proteins (49-51). It was speculated that 
HeLo-NLRs function similarly to ZARI at 
host membranes (8, J, 47). Thus, the RPP1 
resistosome and NLRC4 inflammasome might 
represent comparable paradigms in innate 
immunity signaling. 


Materials and methods 
Protein expression and purification 


Sequence alignment of RPP1_WsB with its 
alleles RPP1_WsA, RPP1_NdA, RPP1_EstA, and 
RPP1_ZdrA by Clustal Omega (52) indicated 
that the N-terminal 60 amino acids of RPP1_ 
WsB are not conserved. Therefore, an N- 
terminally truncated form of WT RPP1_WsB 
(61-1221) was constructed for protein expres- 
sion in insect cells. ATR1_Emoy2 with an N- 
terminal truncation of 51 residues used for 
crystallization in a previous study (24) was 
used for protein expression. ATR1_Emoy2 
(residues 52 to 311) was sufficient for recog- 
nition by RPPI, as demonstrated (24). For 
purification of the RPP1-ATR1 complex, a 
codon-optimized RPP1_WsB (61-1221; GENEWIZ 
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Inc.) construct was cloned into the pFastBac 
1 vector (Invitrogen) with a C-terminal twin- 
StrepII tag. A codon-optimized ATR1_Emoy2 
construct (52-311; GENEWIZ) was cloned into 
the pFastBac 1 vector with a C-terminal 10 x 
HIS tag. These constructs were coexpressed 
in Sf21 insect cells (Invitrogen) at 28°C. After 
recombinant baculovirus infection for 48 hours, 
infected cells were harvested and resuspended 
in buffer A (25 mM Tris-HCl, pH 8.0, 150 mM 
NaCl, 1 mM MgCl,, and 0.3 mM ATP). After 
sonication and centrifugation, the RPP1-ATRI1 
complex was purified using Strep-Tactin resin 
(IBA Lifesciences) from the supernatant. Pro- 
teins bound to resin were eluted with buffer A 
supplemented with 2.5 mM p-desthiobiotin. 
The eluent from Strep-Tactin was loaded 
onto Ni-NTA resin (Novagen) and the resin 
was washed with buffer A containing 20 mM 
imidazole. Proteins bound to the Ni-NTA resin 
were eluted with buffer A plus 250 mM imi- 
dazole. The eluent was concentrated through 
a 30-kKD MWCO Vivaspin 500 concentrator 
(GE Healthcare) to 100 ul and loaded onto a 
Superose 6 increase 5/150 column (GE Health- 
care) with buffer E 0 mM Tris-HCl pH 8.0, 
100 mM NaCl, and 1 mM DTT). Peak fractions 
were concentrated to 0.45 mg/ml for cryo-EM 
sample preparation. A similar procedure was 
used for purification of the RPP1*°*“ (61- 
1221)-ATR1 (52-311) complex. 


Cryo-EM sample preparation and data collection 


An aliquot of 3 ul of purified RPP1-ATR1 was 
applied to holey carbon grids (Quantifoil Au 
1.2/1.3, 300 mesh) glow-discharged for 30 s at 
high level in Harrick Plasma after 2 min eva- 
cuation. Grids were then blotted on filter paper 
(Ted Pella Inc.) for 2.5 s at 8°C with 100% 
humidity and flash-frozen in liquid ethane 
using FEI Vitrobot Marked IV. 

Two datasets of the WI RPP1-ATR1 complex 
were collected: one on a Titan Krios2 electron 
microscope operated at 300 kV, equipped with 
Gatan K3 Summit direct electron detector and 
a Gatan Quantum energy filter, the other on a 
Titan Krios3 electron microscope operated at 
300 KV, equipped with a Cs-corrector, Gatan 
K3 Summit direct electron detector, and a 
Gatan Quantum energy filter. A total of 5701 
and 3834 micrograph stacks were collected on 
Titan 2 and Titan 3, respectively. The micro- 
graph stacks were automatically recorded 
using AutoEMation in superresolution mode 
(53), at a nominal magnification of 64,000 
on Titan 2 and 81,000~x on Titan 3. Defocus 
values varied from -1.0 um to -2.0 um for 
both datasets. Exposure rate of data collec- 
tion on Titan 2 and Titan 3 was 23 and 24 
electrons per pixel per second, respectively. 
The exposure time for both datasets was 2.56 s 
dose-fractionated into 32 subframes, leading 
to a total electron exposure of ~50 electrons 
per A? for each stack. 
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Image processing and 3D reconstruction 

The raw stacks of RPP1-ATR1 recorded in 
superresolution mode were motion-corrected 
by MotionCor2 and binned twofold, resulting 
in a physical pixel size of 1.0742 A per pixel 
for Titan 2 motion-corrected micrographs and 
1.0979 A per pixel for Titan 3 motion-corrected 
micrographs (54). In the meantime, exposure 
rate for the summed micrographs was per- 
formed (55). Contrast transfer function (CTF) 
parameters were estimated by CT FFIND4 
(56). Based on the CTF estimations, 5117 (Titan 
2)/3123 (Titan 3) micrographs were manually 
picked, respectively, and were further pro- 
cessed in RELION3.1. 

About 10,000 Laplacian-of-Gaussian auto- 
picked particles were subjected to 2D classifi- 
cation to generate initial templates for further 
auto-picking. For the dataset collected on 
Titan 2, 1,395,913 auto-picked particles were 
subjected to five rounds of 2D classification, 
which performed 25 iterations with regulari- 
zation parameter T = 2 and number of classes 
= 100 to remove bad particles. Similar param- 
eters were applied in three rounds of 2D clas- 
sification of 1,125,203 auto-picked RPP1-ATR1 
particles from Titan 3. After removal of bad 
particles, the remaining 661,434 particles on 
Titan 2 and 516,036 on Titan 3 for the RPP1- 
ATR1 complex were subjected to 3D classi- 
fication with C, symmetry, using initial 3D 
reference models obtained by ab initio calcu- 
lation from RELIONS.1. 

Particles (276,146 from Titan 2 and 133,202 
from Titan 3 for RPPI-ATR1) from good 3D 
classes with clear overall structural features 
were selected for 3D refinement. At this stage, 
the 3D reconstructions clearly showed four 
RPP1 molecules in the complex. C,, symmetry 
was therefore tested in the following 3D re- 
finement. Resulting reconstructions showed 
that the global density, particularly that cor- 
responding to the TIR part, became worse, 
indicating that lower or no symmetry exists 
in the RPP1-ATRI1 complex. Subsequent 3D 
refinement with C, symmetry greatly improved 
the density quality and global resolution. After 
global 3D refinement with C, symmetry, CTF 
refinement, and postprocessing, the resolution 
of the RPPI-ATR1 reconstruction from the 
Titan 2 and Titan 3 datasets was 3.65 A and 
3.16 A, respectively. Refined RPP1-ATR1 par- 
ticles (409,348) from the two datasets were 
joined and subjected to a further round of 3D 
reconstruction, yielding a final cryo-EM map 
with 3.16 A resolution. 

A core region mask including the TIR, NBD, 
HD1, and WHD domains of RPP1 and ATRI 
was generated by Chimera and then applied to 
3D auto-refinement using C, symmetry and 
409,348 particles from the final reconstruction 
with the merged datasets. In the end, the 
resolution of RPP1 core part reconstruction 
was 2.99 A after postprocess. 
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The LRR and C-JID portions in the final EM 
map were more flexible relative to the other 
parts of the RPP1-ATR1 complex. To improve 
resolution of the local density, C,, symmetry 
was first used to refine the joined particles, 
and the refined particles were expanded with 
C,, symmetry for focused 3D classification. A 
local mask for LRR, C-JID, and ATR1 was 
generated using Chimera and then applied for 
focused 3D classification without alignment as 
described (57). After the focused 3D classifica- 
tion, a total of 222,015 particles were selected 
for C; focused 3D refinement, finally yielding a 
local reconstruction for LRR, C-JID, and ATRI 
with 3.19 A after postprocess. 

2D classification, 3D classification, and 3D 
autorefinement were all performed with 
RELIONS.1 (58-60). The resolutions were 
determined by gold-standard Fourier shell 
correlation (67). Local resolution distribution 
was evaluated using RELION (62). 


Model building and refinement 


The final RPP1-ATR1 EM map was generated 
by merging the global map and local LRR, 
C-JID, and ATR1 map, using combine_focused_ 
map in PHENIX (63). For model building of 
the whole RPP1-ATR1 complex, the RPP1 TIR 
domain (PDB: 5TEB) (J8) and ATRI (PDB: 
3RMR) were docked into the EM map in 
Chimera (64). The models of the NBD, HD1, 
and WHD domains of RPP1 were manually 
built in COOT based on the global EM map of 
RPP1-ATRI1, and the LRR and C-JID domains 
in COOT using the local refined EM map (65). 
All the domains were then combined, gen- 
erating a model containing four ATR1 mole- 
cules and four RPP1 molecules. The generated 
model was refined against the combined RPP1- 
ATRI1 EM density using real-space refinement 
in PHENIX with secondary structure and ge- 
ometry restraints (63). The final model after 
refinement was validated using MolProbity 
and EMRinger in the PHENIX package (63). 
Table S1 summarizes the model statistics. 


In vitro NADase assays 


Purified RPP1, RPP1-ATR1, mutant RPP1-ATR1, 
or RPP1"'* were used for NADase assays at the 
indicated concentrations. Proteins were indi- 
vidually incubated with 100 uM NAD” (final 
concentration) and 10 mM MgSO, or CaCl, in 
buffer containing 100 mM NaCl, 25 mM Tris- 
HCI pH 8.0. The total volume for each reaction 
was 100 ul. Reactions were performed in a 
thermoshaker at 25°C for 16 hours. After re- 
action, samples were centrifuged and imme- 
diately applied for HPLC analysis. 


In vitro ATPase assays 


Purified RPP1’?*“, RPP1™°**-ATR1 complex, 
and ATRI were used for ATPase assays with a 
protein concentration of 5 uM. Each protein 
was incubated with 100 uM ATP (final concent- 
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ration) in buffer containing 10.0 mM Mgso,, 
100 mM NaCl, 25.0 mM Tris-HCl pH 8.0 at 25°C 
for the indicated times. The volume for re- 
action was 100 ul. After reaction, samples 
were centrifuged and immediately applied 
for HPLC analysis. 1.0 uM RPP1*?* was 
used to quantify its ATPase activity. Reac- 
tions were carried out as described above. 
Samples (50 ul) taken at different time points 
(1, 2, 3, 4, and 5 hours) were analyzed on an 
HPLC system to detect the formed ADP. By 
comparison with reference samples (ADP), the 
respective integrated ADP peak area was 
converted into concentration. The ATPase ac- 
tivity was calculated by five data points—based 
linear regression. 


HPLC measurements 


HPLC was performed on an Agilent 1260 
bioinert HPLC system using a Synergi Fusion- 
RP 80 A (4.6 x 150 mm, 4 um) (Phenomenex) 
column. The samples were measured via an 
8-min method. Samples (10 ul) were injected 
at 550 wl/min with ammonium formate (5 mM) 
in water and methanol used as mobile phases 
A and B, respectively. The elution profile was 
as follows: 0 to 3 min, 10 to 70% B; 3 to 6 min, 
70% B; 6 to 6.1 min, 70 to 10% B; 6.1 to 8 min, 
10% B. The autosampler temperature was 
maintained at 4°C and the column temper- 
ature at 25°C. UV signals were detected at 
260 nm. Reference standards were used to 
determine respective retention times. The 
integrations of peak area were used to cal- 
culate relative concentrations. 


Site-directed mutagenesis of RPP1_WsB and 
ATR1_Emoy2 for in planta analyses 


For N. benthamiana transient expression 
analyses, the RPP1_WSsB gene body (exons 
and introns) was PCR-amplified from the 
pENTR/D-TOPO pRPP1:gRPP1 construct pro- 
vided by K. Krasileva (12, 19) and cloned into 
PENTR/D-TOPO vector (Thermo Fisher Sci- 
entific, K240020). The ATR1_Emoy2 sequence 
starts with a codon corresponding to T19 
in NCBI accession AAX51198 ATRI_Emoy?2, 
thereby removing a signal peptide. Muta- 
genesis was performed using a QuikChange 
II site-directed mutagenesis kit (Agilent, 
200523) or KOD-Plus-Mutagenesis kit (CosmoBio, 
SMK-101). Sequences of oligonucleotides are 
provided in table S2. Obtained pENTR/D-TOPO 
RPP1_WsB and pDONR207 ATR1_Emoy?2 plas- 
mids were LR-recombined (Thermo Fisher 
Scientific, 11791020) into pXCSG vectors (66) 
to allow expression of untagged RPP1”*?, 
C-terminally tagged RPP1_WsB-3xHA-StrepII, 
and ATR1_Emoy2-3xHA-StrepII proteins un- 
der a 35S promoter. All constructs were veri- 
fied by DNA sequencing. Generated binary 
constructs were transformed into Agrobacte- 
rium tumefaciens (Rhizobium radiobacter) 
GV3101 pMP9ORK via electroporation. 
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Protein expression in N. benthamiana and 
Western blot analysis 

RPP1_WsB-HS and ATR1_Emoy2-HS were ex- 
pressed in N. benthamiana using agrobacteria- 
mediated transient expression assays in the 
presence of the P19 suppressor of RNAi si- 
lencing as in (49). The final ODgo9 for each 
strain was Set to 0.2. To detect ATRI variants, 
two 8-mm leaf discs per sample were har- 
vested at 2 dai and boiled at 95°C in Laemmli 
buffer for 5 min. The ATR1-HS fusions were 
blotted onto nitrocellulose membranes and 
detected using primary a-HA antibodies (Sigma 
Aldrich, 11867423001 or H6908; Cell Signaling 
Technology, #3724) and secondary horseradish 
peroxidase (HRP)-conjugated antibodies (Santa 
Cruz, sc-2006; Sigma Aldrich, A5164) at di- 
lution 1:5000 (3% milk powder in TBST). 
Detection was performed using enhanced 
chemiluminescence assays Clarity, Clarity 
Max Western ECL (Bio-Rad, #1705061 and 
#1705062). To detect RPP1_WSB-HS var- 
lants, infiltrated N. benthamiana leaves were 
collected at 2 dai, frozen in liquid nitrogen, 
and ground to a fine powder. Powder (~100 ul 
in a tube) was resuspended in 100 ul of urea- 
SDS sample buffer [50 mM Tris-HCl pH 6.8, 
2% SDS, 8 M urea, 2% B-mercaptoethanol, 
5% glycerol, protease inhibitor cocktail 
(Roche), and 0.004% Bromophenol Blue] 
and vortexed for 10 min at room temper- 
ature. No boiling step was included. After 
centrifugation at 16,000g for 10 min, 10 ul 
of the supernatant was loaded onto 8% 
SDS-PAGE and proteins were blotted onto a 
PVDF membrane. Immunoblot assay was per- 
formed using monoclonal rat anti-HA anti- 
body (Sigma Aldrich, 11867423001) diluted 
1:4000 and rabbit anti-rat antibody (Sigma 
Aldrich, A5164) diluted 1:5000 in 1x TBS, 
0.1% Tween-20 with 3.5% w/v nonfat dry milk. 
RPP1_WsB-HS fusion proteins were detected 
using ECL SuperSignal West Femto Maxi- 
mum Sensitivity Substrate and ECL Western 
Blotting substrate (Thermo Scientific) in a 
ratio of 2:1. 


Cell death quantification in N. benthamiana 


RPP1_WsB-HS and ATR1_Emoy2-HS WT and 
mutant protein combinations were transiently 
expressed in N. benthamiana as described 
above (49) and agrobacteria-infiltrated leaf 
zones used for cell death (ion leakage) assays 
at 3 dai as described (49). Statistical analysis 
was performed on conductivity data normal- 
ized to the mean level in samples containing 
WT RPP1_WsB and ATR1_Emoy2 samples via 
Tukey's HSD (honestly significant differ- 
ence) test after checking normality of residuals 
distribution and homogeneity of variance 
using visual examination of the plots and 
Shapiro-Wilcoxon and Levene tests (P > 
0.05). Images of agrobacteria-infiltrated leaf 
spots were taken at 4 to 5 dpi. 
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RPP1_WSB jelly roll/lg-like (JID) sequence 
distribution across plants 

Protein sequences of 32 representative plant 
species from green algae to Arabidopsis 
thaliana were obtained as in (49). Sequences 
similar to the jelly roll domain of RPP1_WsB 
(amino acids 1075 to 1195) were initially 
searched with BLASTP (-evalue 0.01). The 
jelly roll-like sequences were detected at 
this step only in Brassicaceae plants. The 
match sequences were extracted and aligned 
via the Muscle method. A hidden Markov 
model was built from the resulting align- 
ment [hmmbuild in HMMER 3.1b2 (67), 
default settings]. This model (v1) identified 
622 matches mainly in Brassicales but also 
in Fabales, Malpighiales, and Rosales (Ref- 
erence Proteomes 2020 _04, hmmsearch at 
EMBL-EBI-incE 0.01). To improve sensitiv- 
ity of the search, a next version of the HMM 
model was prepared. For that, the vl HMM 
was run against the 32 species protein data- 
base above (hmmsearch in HMMER 3.1b2,- 
incE 0.01) and not against public databases 
to avoid with redundancy and skewing toward 
Brassicaceae in public databases. The result- 
ing matches were again extracted, aligned 
with Clustal Omega, and the obtained align- 
ment served as an input for building the 
version 2 HMM, available as data S1. The v2 
HMM run against Uniprot database (2019-10- 
03) at EMBL-EBI (hmmsearch -El-domE 1- 
incE 0.001-incdomE 0.03-seqdb uniprotkb) 
identified 2711 hits, only in eudicots. The 
majority of found proteins have a typical TNL 
domain architecture, other hits have different 
alrrangements/combinations of NBARC, LRR, 
TIR domains and occasionally other domains. 
Finally, to assess distribution of the RPP1_WsB- 
like JID across plants, the v2 HMM (data S1) 
was scanned against the above custom non- 
redundant database of protein sequences from 
32 representative plant species (hmmsearch- 
incE 0.01). 
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Structure of the activated ROQ1 resistosome directly 
recognizing the pathogen effector XopQ 


Raoul Martin*, Tiancong Qi*, Haibo Zhang, Furong Liu, Miles King, Claire Toth, 


Eva Nogales{, Brian J. Staskawiczt 


INTRODUCTION: Plants and animals respond to 
pathogen invasion through intracellular nucleotide- 
binding leucine-rich repeat receptors (NLRs) 
that directly interact with pathogen proteins 
or indirectly detect pathogen-derived alterations 
in the host proteome. Upon recognition of 
pathogen invasion, NLRs trigger an immune 
response that resolves in a variety of ways 
depending on the type of NLR being activated. 
The overall architecture of NLRs is highly con- 
served, consisting of a C-terminal leucine-rich 
repeat (LRR) platform that determines substrate 
specificity and a central nucleotide-binding 
oligomerization domain. The N-terminal domain 
varies between NLRs and determines the mech- 
anism used by the host to activate the immune 
response. Thus, NLRs in plants have been clas- 
sified according to their N-terminal domain 
into Toll/interleukin 1 receptor (TIR) NLRs 
(TNLs), coiled-coil NLRs (CNLs), and RPW8- 
like coiled-coil NLRs (RNLs). Pathogen detec- 
tion and oligomerization of the NLR activates 
these N-terminal domains by bringing them in 
close contact. In all three cases, association of 
the N-terminal domain leads to localized cell 
death and expression of disease resistance. The 
TIR domains of TNLs have been shown to have 
oligomerization-dependent NADase activity that 
is required for promoting cell death, but it is not 
understood how the interactions between TIR 
domains renders them catalytically active. 


RATIONALE: The structure of the ROQ1 (recog- 
nition of XopQ 1)-XopQ (Xanthomonas outer 
protein Q) complex, an immune receptor bound 
to its pathogen substrate, was used as a model 
to study the mechanism of direct binding, oli- 
gomerization, and TIR domain activation of 


Proposed mechanism of ROQ1 
activation. The LRR and C-JID of 
ROQI recognize the pathogen effec- 
tor XopQ. ROQI oligomerizes 
through the NB-ARC domain (NBD, 
HD1, WHD) in an ATP-bound state. 
TIR domain association causes a 
conformational rearrangement of the 
BB-loop and opens the NADase active 
site. Catalytic activity of the TIR 
domains further signals the immune 
response, resulting in cell death. 


Martin et al., Science 370, 1185 (2020) 


Recognition 
of XopQ 





TNLs. ROQ1 has been shown to physically in- 
teract with the Xanthomonas effector XopQ, 
causing it to oligomerize and trigger a TIR- 
dependent hypersensitive cell death response. 
We coexpressed, extracted, and purified the as- 
sembled ROQI-XopQ complex from ROQIT’s 
native host, Nicotiana benthamiana, and solved. 
its structure by cryo-electron microscopy to 
3.8-A resolution. The interactions described in 
our structure were further confirmed by in vivo 
mutational analysis. 


RESULTS: Our structure reveals that ROQI 
forms a tetrameric resistosome upon recogniz- 
ing XopQ. The LRR and a post-LRR domain 
named the C-terminal jelly-roll/Ig-like domain 
(C-JID), form a horseshoe-shaped scaffold 
that curls around the pathogen effector, 
thereby recognizing multiple regions of the 
substrate. Binding of the ROQ1 LRR to XopQ 
occurs through surface-exposed residues that 
make up the scaffold of the domain, as well as 
an elongated loop between two LRRs that 
forms a small amphipathic o-helix at the site 
of interaction. The mode of substrate recog- 
nition by the C-JID is reminiscent of that used 
by immunoglobulins to bind their antigen. 
Similar to the complementary-determining 
regions of antibodies, interconnecting loops 
emerging from the C-JID B-sandwich struc- 
ture make substrate-specific contacts with 
XopQ. In particular, an extended loop of 
the C-JID dives into the active-site cleft of 
XopQ and interacts with conserved resi- 
dues required for nucleoside binding, sug- 
gesting that ROQI not only recognizes its 
substrate but also inhibits its ligand-binding 
function. 


ROQI 
oligomerization 
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TIR domain 
association 





TIR TIR 
dimer dimer 


The nucleotide-binding domain (NBD), heli- 
cal domain 1 (HD1) and the winged-helix do- 
main (WHD), termed NB-ARC because of their 
presence in Apaf-1, R proteins, and CED-4 
(ARC), are responsible for ROQ1 oligomer- 
ization in an ATP-bound state. Individual 
protomers intercalate in a similar fashion as 
found in other NLR structures, promoting as- 
sociation between the N-terminal TIR domains. 
The TIR domains bind to each other through 
two distinct interfaces (called AE and BE), 
causing them to form a dimer of dimers. BE- 
interface contacts cause a conformational re- 
arrangement in a loop, called the BB-loop, 
at the periphery of the TIR domain active site 
that exposes the putative catalytic glutamate 
that is suggested to cleave NAD”. These results 
provide a rationale for the previously deter- 
mined oligomerization dependence of TIR 
domain NADase activity. 


CONCLUSION: We propose a step-by-step mech- 
anism for ROQ1 immune signaling based on 
our structure of the activated complex and 
on previous biochemical studies. The LRR and 
C-JID of ROQI] recognize the pathogen effector 
through direct contacts with its surface and 
active-site residues. Detection of the substrate 
releases autoinhibitory contacts between the 
NB-ARC domain and the LRR, allowing the 
NB-ARC domain to transition to an ATP- 
bound, oligomerization-prone state. Com- 
plex assembly brings the TIR domains in close 
contact, leading to opening of the NADase ac- 
tive site in an interface-dependent manner. 
Cleavage of NAD” by the TIR domain results 
in the release of adenosine diphosphate ri- 
bose, a signaling molecule that triggers cyto- 
solic Ca? influx, a widely used chemical cue in 
response to various biotic and abiotic stresses, 
leading to downstream activation of localized 
cell death and disease resistance. 
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recognizing the pathogen effector XopQ 


Raoul Martin'2*, Tiancong Qi***, Haibo Zhang’, Furong Liu®, Miles King”, Claire Toth, 
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Plants and animals detect pathogen infection using intracellular nucleotide-binding leucine-rich repeat 
receptors (NLRs) that directly or indirectly recognize pathogen effectors and activate an immune 
response. How effector sensing triggers NLR activation remains poorly understood. Here we describe 
the 3.8-angstrom-resolution cryo—electron microscopy structure of the activated ROQI1 (recognition 

of XopQ 1), an NLR native to Nicotiana benthamiana with a Toll-like interleukin-1 receptor (TIR) domain 
bound to the Xanthomonas euvesicatoria effector XopQ (Xanthomonas outer protein Q). ROQ1 directly 
binds to both the predicted active site and surface residues of XopQ while forming a tetrameric 
resistosome that brings together the TIR domains for downstream immune signaling. Our results 
suggest a mechanism for the direct recognition of effectors by NLRs leading to the oligomerization- 
dependent activation of a plant resistosome and signaling by the TIR domain. 


lants have a sophisticated and finely 

tuned innate immune system that re- 

cognizes invading phytopathogens to 

protect from infection and disease. Path- 

ogen recognition is facilitated by both 
membrane-anchored pattern recognition re- 
ceptors and intracellular innate immune 
receptors (/). The latter include the nucleotide- 
binding leucine-rich repeat receptors (NLRs) 
(2). Although some NLR immune receptors 
directly bind pathogen effector proteins, others, 
such as ZARI, monitor effector-mediated al- 
terations of host targets to activate effector- 
triggered immunity (ETTI) (3-5). ETI activation 
is often accompanied by localized cell death 
referred to as the hypersensitive response (HR). 
Animals also use NLR proteins as intracellular 
immune receptors to recognize potential path- 
ogens, and the NLR domain architecture is 
highly conserved, with each region playing a 
specific role in its mechanism of action (6). 
Plant NLRs generally consist of three domains: 
an N-terminal region that is either a coiled-coil 
(CC) domain or a Toll-like interleukin-1 recep- 
tor (TIR) domain, a central nucleotide-binding 
(NB) domain conserved in APAF-1, other 
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R-proteins, and CED-4 (NB-ARC), and the 
C-terminal leucine-rich repeat (LRR) domain 
(2). Plant NLRs are divided into TIR-NLRs 
(TNLs), CC-NLRs (CNLs), and RPW8-like 
CC (CCp)-NLRs (RNLs) based on their N- 
terminal domains, with experimental evidence 
consistently suggesting that oligomerization 
of the N-terminal domains is required for 
signal transduction and expression of disease 
resistance (3). 

Although the activation mechanism of a 
plant CNL resistosome has been elucidated 
(7, 8), the mechanism of TNL activation remains 
elusive. There is still no structural evidence 
for TNL resistosome formation. TIR domains 
of both plant and animal NLRs were reported 
to have a nicotinamide adenine dinucleotide 
(NAD*) nucleosidase activity that requires 
TIR domain oligomerization to trigger hyper- 
sensitive cell death (9, 10). Whether the 
NADase activity of the TIR domain is fully re- 
sponsible for ETI activation and why NAD* 
cleaving only happens in the presence of TIR 
self-association require further investigation. 

To further our understanding of the molec- 
ular events that control the direct recognition 
of pathogen effectors and activation of TNL 
immune receptors, we transiently coexpressed 
Xanthomonas euvesicatoria type III effector 
XopQ (Xanthomonas outer protein Q) and 
its TNL receptor ROQ1 (recognition of XopQ. 
1) in Nicotiana benthamiana edsI-1 mutant 
leaves, copurified them by sequential affinity 
chromatography, and solved a 3.8-A cryo- 
electron microscopy (cryo-EM) structure of the 
assembled protein complex. XopQ is highly 
conserved across various Xanthomonas spe- 
cies and has been shown to have nucleoside 
hydrolase activity, to physically interact with 
14-3-3 proteins of the plant host, and to sup- 
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press ETI (77). Nevertheless, the precise mech- 
anism of XopQ that promotes pathogen viru- 
lence remains unclear. Recognition of XopQ 
by ROQI has been shown to trigger down- 
stream ETI signal transduction, leading to a 
hypersensitive cell death response and resist- 
ance to pathogen invasion (11, 12). Our struc- 
tural data reveal that ROQ] directly binds XopQ 
to activate a tetrameric resistosome. We iden- 
tified a series of necessary contacts for XopQ 
recognition by ROQI and describe the struc- 
ture of a post-LRR (PL) domain that is es- 
sential in effector binding. We also describe 
the overall oligomeric state of ROQI and the 
interfaces formed by the NB-ARC domain in 
an open conformation. Finally, we provide an 
explanation for the requirements of oligomer- 
ization in TIR activation, which involves opening 
of the NADase active site in an interface- 
dependent manner. Together, our results pro- 
vide the structural basis for direct effector 
recognition, oligomerization, and activation of 
TNLs that reveals how these immune recep- 
tors detect pathogens and signal an immune 
response. 


Overall structure of the ROQ1 resistosome 


XopQ recognition by ROQI triggers a rapid 
cell death response in wild-type N. benthamiana 
leaves, making it difficult to obtain sufficient 
protein for expression and purification (JJ). 
All plant TNLs require the downstream EDS1 
protein to achieve cell death and express dis- 
ease resistance (13). To obtain live tissue for 
protein purification, we transiently coexpressed. 
ROQI and XopQ by Agrobacterium-mediated 
transformation in CRISPR-induced edsI-1 mu- 
tants of N. benthamiana known to prevent 
ROQI1-induced cell death (12). Cryo-EM imag- 
ing and analysis of the affinity-purified com- 
plex yielded a reconstruction at 3.8-A overall 
resolution with C4 symmetry imposed (figs. S1 
and S2) and showed that the ROQ1 protomers 
assemble into a tetrameric, four-leaf clover 
structure with XopQ present in a 1:1 ratio 
(Fig. 1). Further image processing was re- 
quired to allow building of atomic models 
(fig. S3). We found that the nucleotide-binding 
domain (NBD), helical domain 1 (HD1), and 
winged helix domain (WHD) provide the ne- 
cessary contacts for ROQ] oligomerization and 
bring together the four TIR domains. The LRR 
features the characteristic horseshoe shape 
and wraps around the XopQ effector protein, 
recognizing its surface residues. The cryo-EM 
map also reveals a PL domain at the C-terminal 
end of the LRR connected by a short 10-residue 
linker (Figs. LA and 2). To improve the density 
of ROQ1 bound to XopQ, we applied symmetry 
expansion and focused refinement around the 
LRR-PL-XopQ region (figs. S1 and S3). The im- 
proved reconstruction showed that the XopQ 
effector is in its open conformation, exposing 
the cleft of the predicted nucleoside hydrolase 
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Fig. 1. Overall structure of the ROQ1-XopQ complex. (A) Schematic representations of ROQ] and XopQ 
with color-coded domain architecture: TIR, yellow; NB-ARC NDB, HD1, and WHD, light green, green, and dark 
green, respectively; LRR, violet; C-JID (or PL domain), light blue; and XopQ, salmon. (B and C) Composite 
density map of the ROQ1-XopQ complex from three cryo-EM reconstructions (B) and corresponding atomic 
model (C) shown in three orthogonal views. Colors are according to the nomenclature in (A). 


active site (Fig. 2E). XopQ’s specific substrate 
remains unidentified, but previous studies have 
shown that XopQ binds adenosine diphosphate 
ribose (ADPR), an important immune signaling 
molecule in plants, consistent with its immu- 
nosuppressive function (74). 


Recognition of XopQ by ROQ1 


The 24 LRRs of ROQ1 form a 150-A-long scaf- 
fold that bends around XopQ, displaying key 
contact residues along its surface (Fig. 2). We 
found that the LRR of ROQ1 interacts with 
the effector in two different ways. First, in the 
region where XopQ is in close contact with the 
LRR scaffold, several side chains exposed on 
the surface of the LRR directly interact with 
the substrate (Fig. 2A). A similar mechanism is 
used by the LRRs of the CNL ZARI to re- 
cognize RKS1 and TLR3 to recognize double- 
stranded RNA (8, 15). Most of these residues 
have large aromatic side chains that recognize 
hydrophobic patches and grooves on the sur- 
face of XopQ. Second, in regions where the 
LRR scaffold is too far away to interact with 
the effector directly, we found an elongated 
linker between two LRRs (LRRs 23 and 24) 
that reaches over to bind XopQ (Fig. 2C). A 
small amphipathic a-helix is formed at the site 
of contact, with hydrophobic side chains re- 
cognizing conserved residues at the outer edge 
of XopQ’s active site cleft (Y311%°2, H433*°P) 
(Fig. 2C). The extended linker then loops back 
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toward the scaffold and forms the next repeat 
in the LRR. Mutating the residues that form 
the hydrophobic face of the a-helix (L1075%°@, 
w10798°@ 110788°*) to alanines resulted 
in loss of the HR phenotype, suggesting that 
these interactions are critical for XopQ recog- 
nition (fig. S4). 

A 10-residue linker (amino acids 1120 to 1129) 
connects the C-terminal end of the LRR to 
the PL domain, which also interacts with 
XopQ (Fig. 2, B and D). This domain folds into 
a B-sandwich, with nine antiparallel B-strands 
arranged into two f-sheets (fig. S5). The last 
LRR forms hydrogen bonds with one of the 
B-strands, thereby rigidifying the conforma- 
tion between the two domains. Because PLs 
serving in pathogen detection have been found 
in other TNLs but remain poorly characterized, 
we sought to further investigate the possible 
structural homology of the ROQ1 PL domain 
with published structures (16). Analysis using 
the CATH database (17) revealed proteins with 
immunoglobulin-like and jelly-roll folds as the 
closest structural homologs (fig. S5A). Further- 
more, the PL domain of RPP1, another TNL, 
shares a similar structure to the PL domain in 
ROQI (18) despite having low sequence iden- 
tity (14.29%) (fig. S5). The core of this domain 
in RPP1 also folds into a B-sandwich structure 
that forms hydrogen bonds with the last B-strand 
of the LRR (/8). Both share the same £-strand 
topology, with the exception of a ninth C-terminal 
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B-strand present in Roq! (fig. S5B). The major 
structural differences in the ROQ1 and RPP1 
PL domain are found in the loops that inter- 
connect the -strands and serve to recognize 
their respective substrates. Domains fused 
to the C terminus of LRRs are found in many 
other NLRs (3, 16) and are likely to differ in 
structure and function. Therefore, to address 
the specific type of PL domain used by ROQ1 
and RPP1, we refer to it as the C-terminal 
jelly-roll/Ig-like domain (C-JID). 

The mode of recognition used by the C-JID to 
detect the foreign protein is reminiscent of the 
way immunoglobulins bind to their antigen 
(fig. S6). Loops emerging from the B-sandwich 
structure target sites in XopQ to form substrate- 
specific contacts with the pathogen protein in 
a manner that resembles the complementarity- 
determining regions found in antibodies. The 
loop between £-strands 7 and 8 of the ROQ1 
C-JID simultaneously recognizes a hydropho- 
bic pocket through the insertion of an iso- 
leucine (112778°%) and an area of negative 
potential targeted by R1280*°@ (Fig. 2B). Dis- 
rupting these interactions with a ROQ1 double 
mutant (R1280D and 11277A) prevented HR in 
tobacco leaves, suggesting that these inter- 
actions are essential for XopQ detection by 
ROQI (fig. S4). 

The greatest number of contacts between 
the C-JID and XopQ are made by a 33-residue 
loop (amino acids 1163 to 1196) that dives into 
the active-site cleft of the effector and posi- 
tions side chains in close contact with con- 
served sites required for ADPR binding (Fig. 2, 
D and E) (14). We refer to this loop as the NR 
loop for its ability to bind residues in XopQ 
responsible for nucleoside recognition. Two 
a-helical segments of the loop bring together 
large hydrophobic side chains that interface 
with the interior lid region of XopQ (Fig. 2, D 
and E). The conserved XopQ residues targeted 
in this region (W361¥°S, F366*°P2, L345%°P2) 
serve to recognize the base moiety of ADPR 
(14). Active-site residues that would otherwise 
stabilize the a-phosphate of the ligand (Y311*°?2 
and Y398*°P®) are recognized by ROQI V11718O™, 
H1178 ®°%, and H1179"°@ (14). Additionally, 
H1179%°@ interacts with D120*°?°, which is 
involved in the recognition of one of the sugar 
moieties in ADPR (4). In summary, the con- 
served residues in XopQ involved in recogniz- 
ing the base, a-phosphate, and ribose moieties 
in ADPR are targeted by the ROQ1’s NR loop. 
Mutating the NR loop to a short flexible linker 
(-SGGGSGGS-) resulted in the loss of HR, sug- 
gesting that the ROQ] mutant could no longer 
recognize XopQ (fig. S4). Comparison of our 
structure of XopQ, which is in an open state, 
with the closed, ADPR-bound state (PDB: 4P5F) 
shows that the NR loop overlaps with ADPR 
and thus would prevent the ligand from en- 
tering the active-site cleft or interacting with 
XopQ (Fig. 2E). The presence of the NR loop 
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Fig. 2. Structure of the ROQ1 LRR and C-JID (PL domain) binding to XopQ. (A) Surface contacts 
between the N-terminal region of the LRR, shown with a violet ribbon, and XopQ, represented by its Coulombic 
surface potential. (B) Surface contacts made by the loop between B-strands 7 and 8 of the C-JID domain (light 
blue) and XopQ. (C) The elongated LRR between repeats 23 and 24 (violet) interacting with XopQ (salmon). 
(D) Interactions between the NR loop (light blue) and active-site residues of XopQ required for ADPR binding. 
Catalytic Ca** is shown in gold. (E) Left: Structure of XopQ in the open conformation built from our cryo-EM 
density, with the NR loop inserted into the active-site cleft. The position of ADPR (green arrow) from 

the close state of XopQ (PDB: 4P5F) is modeled to show its overlapping position with the NR loop. Right: 
ADPR-bound, closed state of XopQ. The NR loop is modeled to demonstrate the clashes that would 
occur upon XopQ closure. (F) Residue conservation of the C-JID. Regions where too few sequences aligned 
to calculate a reliable conservation score are colored in gray (labeled “Insuff.”). 


may also block XopQ from transitioning to the 
closed state because the NR loop would clash 
with the lid region capping the active site. These 
observations led us to hypothesize that ROQ1 
not only recognizes the pathogen effector but 
may also inhibit its mechanism of ligand bind- 
ing (19). 

The C-JID of ROQ] has a conserved B-sandwich 
core that may be found in the NLRs of other 
members of the nightshade family (Fig. 2F). 
We ran a BLAST search using the sequence 
of the C-JID (amino acids 1129 to 1306) and 
found multiple hits corresponding to resist- 
ance genes in other species of tobacco, as well 
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as in various species of potatoes, peppers, and 
morning glories. The more conserved resi- 
dues are within the strands of the B-sandwich, 
whereas the loop residues pointing toward 
XopQ are more variable (Fig. 2F). The NR 
loop is only found in three other tobacco spe- 
cies, with minor sequence differences (V1171—I, 
Y1195—>F). This pattern of conservation sug- 
gests that the variable loops emerging from 
the C-JID core of related NLRs could serve to 
recognize different pathogen effectors using 
a mechanism similar to that used by ROQI. 
Such a strategy would be akin to that of se- 
quence variations in the complementarity- 
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determining regions of antibodies that en- 
able them to recognize a diversity of epitopes. 
Previous studies suggested that it was diffi- 
cult to identify mutations in XopQ that could 
evade ROQ1] recognition (20). This is consist- 
ent with our results demonstrating that the 
LRR and C-JID of ROQ1 make multiple con- 
tacts with XopQ, suggesting that this gene 
may be durable in the field and difficult for the 
pathogen to evade. In the future, these con- 
tacts could be modified to build synthetic re- 
ceptors targeting various pathogen effectors, 
resulting in new recognitional specificities. 


Oligomerization of ROQI1 


NLRs are generally thought to exist in an in- 
hibited state mediated by either intra- or inter- 
molecular contacts that prevent oligomerization 
between protomers and activation of the im- 
mune response (6, 2/). Structural studies of 
inactive NLRs suggested that these inhibitory 
contacts hold the nucleotide-binding region 
(NBD, HD1, WHD) in a closed state (7, 22, 2.3). 
Upon activation, these interactions must be 
disrupted to transition to the oligomerization- 
prone state, where the WHD is moved away 
from the nucleotide-binding site, thereby dis- 
placing the ADP-specific MHD motif on the 
WHD and allowing adenosine triphosphate 
(ATP) or deoxyadenosine triphosphate (dATP) 
binding (NLRs have been shown to bind both 
ATP and dATP in their active state; in this 
study, we used ATP) (8, 24-27). We expect 
ROQI to be similarly regulated by autoinhibi- 
tory contacts with the LRR based on evidence 
demonstrating that a truncated version of 
ROQI1 missing the LRR and C-JID regions 
spontaneously triggers an immune response 
in the absence of effectors (12). We sought to 
determine whether the C-JID could play a role 
in autoinhibition. Removing the C-JID of ROQ1 
(APL) resulted in loss of HR in planta, sug- 
gesting that the LRR, not the C-JID, is involved 
in making the intramolecular contacts that 
obstruct a conformational switch to the active 
state (fig. S4). 

Four ROQI protomers oligomerize through 
the NB-ARC domains upon substrate recogni- 
tion (12). Our density map reveals the molec- 
ular contacts between the three subdomains 
of the NB-ARC (NBD, HD1, and WHD) in the 
context of the resistosome, as well as the pres- 
ence of ATP at the nucleotide-binding pocket, 
consistent with an activated state of ROQI 
(Fig. 3). The ATP molecule is stabilized at the 
interface between the NBD and HDI, with 
the NBD recognizing the B-phosphate through 
the canonical P loop (K224 and T225). The 
two aspartates (D300 and D301) in the Walker 
B motif are in close proximity to a Mg”* ion 
that is further coordinated by the B- and 
y-phosphate of ATP as well as T225 of the P-loop 
(Fig. 3A) (28). Further recognition of the ligand 
is provided by R329 of the TTR motif (amino 
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Fig. 3. Oligomerization interfaces between 
NB-ARC domains. (A) ATP modeled in the cryo-EM 
density (4.80) near an oligomerization interface, 
showing the side chains of residues involved in 
ATP and Mg** (magenta) binding. (B) Interface 
between two NB-ARC domains of neighboring proto- 
mers. (C) Left: Contacts between the WHD and HDI. 
Right: Contacts between neighboring NBDs. Colors 
are according to the nomenclature in Fig. 1. 


acids 327 to 329) that interacts with the ATP 
y-phosphate and by residues forming a pocket 
around the base moiety (L190, 1193, L356). 
The role of R329 in sensing the ATP molecule 
is highlighted by a loss of HR phenotype when 
it is mutated to an alanine (fig. S4). 

In agreement with published NLR struc- 
tures in the active state (8, 24-27), the WHD 
of activated ROQ1] is rotated away from the 
nucleotide-binding site, thereby displacing the 
MHD motif and exposing the oligomerization 
interface (Fig. 3, B and C). This arrangement 
allows the NBD-HD1 surface of a protomer to 
intercalate with the NBD-WHD surface of its 
neighbor. The major interactions involve an 
HD1-WHD interface and an NBD-NBD inter- 
face (Fig. 3C). HD1 binds to the neighboring 
WHD using a mixture of polar and hydropho- 
bic contacts. Residues in the fourth a-helix 
of HD1 (amino acids 401 to 413) play an 
important role in forming the ROQ1 tetramer 
(Fig. 3C, left). Single point mutations changing 


Martin et al., Science 370, eabd9993 (2020) 


the character of these residues (E399R, V403D, 
and R410A) resulted in loss of HR, suggesting 
that ROQI1 oligomerization was disrupted (fig. 
S4). Similar results were observed when muta- 
ting a charged residue that brings together 
NBD domains (R229D) (fig. S4). In other struc- 
tures of multimeric NLRs (8, 24-27, the con- 
tacts between the NBDs also involve their 
N-terminal linker. The equivalent linker in 
ROQI is poorly resolved in our cryo-EM map 
compared with the surrounding NBD, for 
which we observed well-defined side-chain 
densities, indicating that the linker region 
in ROQI1 is flexible in the tetrameric state. 
Furthermore, the same linker in other NLRs 
provides contacts that are in part responsible 
for properly positioning the NBDs relative to 
each other. In fact, in NLRs that form larger- 
order oligomers, the linker forms an a-helix, 
whereas in smaller complexes such as the 
pentameric ZAR1, the N-terminal linker forms 
a slim structured loop without any secondary 
structure, allowing for the NBD to pack more 
tightly (fig. S7). Similarly, the poorly defined 
structure in the ROQ] linker could explain the 
tight packing between NBDs that results in 
tetramerization instead of higher oligomeric 
states. 

Mechanisms have been proposed for the 
oligomerization of NLRs with differing reli- 
ance on nucleotide binding (27). In the case of 
ZARI, indirect substrate recognition mediated 
by the guard protein RKSI causes a conforma- 
tional change in the NBD and triggers ADP 
release, but the individual ZARI protomers 
are still unable to oligomerize independently 
of ATP (7, 8). By contrast, the direct recogni- 
tion of flagellin by the NLR NAIP5 induces a 
large conformational transition to the active 
state (26, 29) and has been shown to activate 
even when the ATP-binding P-loop motif was 
mutated (30). The structure of the ROQI NB- 
ARC domain closely resembles that of ZARI 
(fig. S8) and shares a 22.2% sequence simi- 
larity. Previous studies have also shown that 
mutation in the P-loop of ROQI prevented oli- 
gomerization (12), suggesting that ATP bind- 
ing is required for assembly. On the basis of 
these observations, we expect ROQI to follow 
a similar oligomerization mechanism to ZARI, 
in which substrate recognition by the LRR and 
C-JID of ROQ1 induces a conformational change 
in the NBD that releases ADP. ATP binding 
would then be required to transition to the 
oligomerization-prone state. 


TIR domain oligomerization and activation 


Tetramerization of the NB-ARC domains brings 
the TIR domains into close proximity (Fig. 4). 
The individual TIR domains interact with each 
other upon resistosome assembly, allowing 
them to become active NADases and trigger 
HR (31). The mechanism for how this asso- 
ciation renders TIRs catalytically active re- 


4. December 2020 


mains poorly understood. Many structural 
studies on TNL have relied on truncated TIR- 
containing proteins that are missing the sub- 
units driving oligomerization (10, 32-36). Here, 
we describe a mechanism for TIR association 
and activation in context of the fully assembled 
ROQI TNL. 

Our initial fourfold symmetric reconstruc- 
tion of ROQI-XopQ could not clearly resolve 
the density corresponding to the TIR domains. 
Further analysis (see the materials and meth- 
ods) revealed that the four TIR domains do 
not assemble in a fourfold symmetric fashion, 
but rather form a twofold symmetric dimer of 
dimers. The change in symmetry at the TIR 
domains highlights the importance for flexi- 
bility in the linker that connects them to the 
NBDs, as discussed previously. After adjusting 
the symmetry for this region and performing 
focused refinement, the TIR domains reached 
an overall resolution of 4.6 A, allowing us to 
visualize secondary structure elements and 
trace the polypeptide backbone (figs. S1 and 
S3). The TIR domains are arranged forming 
two types of interfaces. First, TIR domains 
engage in a head-to-head, symmetric inter- 
action involving alpha helices the aA and aE, 
of each protomer [nomenclature of TIR struc- 
tural motifs as in (37), shown as interaction 
between the same color protomers; Fig, 4, A 
and B]. This interface, previously called the 
AE interface, is also found in many crystal 
structures of isolated plant TIR domains, in- 
cluding RPP1, RUNI, and SNCI (fig. S9) (70, 32). 
Consistent with published studies on these plant 
TNLs, mutating residues in the oA helix of 
ROQ1 (H30A) disrupts HR and highlights the 
functional importance of these contacts (fig. S4). 

In our structure of ROQI-XopQ, TIR do- 
mains engaged in an AE-AE interaction then 
further dimerize head-to-tail, forming what 
is described as the BE interface (38) (shown 
as interactions between the different colored 
protomers; Fig. 4, A and C). In the BE inter- 
action, the so-called BB-loop (residues between 
BB and aB) of one TIR domain plugs beneath 
the loop between aD; and aF; of the adjacent 
one (Fig. 4C). Previous mutational analyses al- 
ready demonstrated a functionally important 
role for the BB-loop in TIR domains (10, 39). 
We further mutated residues in the aD3-aFı 
loop (1151A and G153A) that are in close con- 
tact with the BB-loop and found that they 
independently resulted in loss of the HR 
phenotype (fig. S4). 

Association between plant TIR domains at 
the DE surface (formed in part by the aD3- 
aE, loop) has previously been observed in 
crystallographic studies, but their conforma- 
tions are different from the ones defined by 
our cryo-EM analysis. For example, the TIR 
domains of RPP1, SNC1, and L6 face each 
other head-to-head at their DE surfaces with 
different rotational angles relative to each other 
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Fig. 4. TIR domain interfaces and conformational rearrangement of the BB-loop. (A) Top view of 
ROQ1 displaying the four TIR domains organized as a dimer of dimers (each symmetric dimer shown in 
distinct yellow and orange). The two interfaces are marked with black dotted lines; the AE interface is formed 
between TIR domains shown in the same color. (B) Orthogonal view from (A) of the AE interface. (C) Orthogonal 
view from (A) of the BE interface marking the BB-loop positioned under the aD; to aE; helices. The proposed 
paths of the protein chain linking the TIR domain to the NBD are shown with purple dotted lines. (D) Top: NADase 
active site of a TIR domain for which the BB-loop is not interfacing with the DE surface. Bottom: 
Conformational rearrangement in the BB-loop bound to the DE surface. The side chain of the putative catalytic 
glutamate (E86) is shown in stick representation. (E) Hypothetical mechanism of TIR oligomerization with the 
position of the BB-loop in red. (1) Individual TIR domains are brought in close proximity. (2) TIR domains 
recognize each other at the AE and BE interface. (3) Assembly causes the conformational rearrangement in the 
BB-loop that opens the NADase active site. A Gaussian filter was applied to the map in (A) to (D) (width 1.5 A) 


to reduce noise. 


instead of interacting in a head-to-tail fashion 
(32), perhaps because the TIR domains were 
visualized in isolation and the NB-ARC do- 
main responsible for driving oligomerization 
was truncated. These studies highlighted the 
importance of the DE surface in plant TIR 
domain oligomerization, but the proper inter- 
actions remained unclear because of the var- 
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lability in conformation between structures. 
The BE interaction is found in more distant 
phyla. The crystal structures of TIR domains 
from the human SARMI1 (0) and MAL (40) 
proteins, as well as TRR-2 (unpublished, PDB: 
4W8G and 4W8H) from Hydra magnipapillata 
share a similar BB-loop conformation to that 
found in the activated ROQ] tetramer, in which 
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it fits under the aD3-aE, loop (fig. S10). This 
structural relationship suggests shared mech- 
anistic features for TIR domain assembly and 
activation between animals and plants. In 
fact, the human SARM1 TIR domains simul- 
taneously form AE and BE interfaces in the 
crystal lattice (figs. S9 and S10) (0). 

Our structure now demonstrates that the 
BB-loop takes on two different conformations 
within the activated ROQI tetramer and under- 
goes a conformational switch as TIR domains 
interact at a BE interface. In the protomer in 
which the BB-loop is unbound (the one that is 
contributing to the BE contacts through its DE 
surface), it is seen in an upward position along 
the rim of the NADase active site (Fig. 4C). 
This conformation is the same as that found in 
the TIR crystal structures lacking BE contacts 
(32). In the other protomer, the BB-loop inter- 
facing with the adjacent DE surface has been 
repositioned by a downward motion of ~12 A 
(Fig. 4D, bottom). A highly conserved glycine 
residue (G52®°™) in the BB-loop likely pro- 
vides the flexibility required to undergo this 
conformational switch (fig. S11). Mutating this 
glycine to a proline resulted in loss of HR (fig. 
S4). Similarly, mutating the equivalent glycine 
in SARM1 (G601P) was shown to hinder flex- 
ibility and prevent a transition to the engaged 
state, resulting in a defective BE interface 
with the loop stuck in the upward position 
and a severe decrease in NADase activity 
(fig. S11) (10). 

Repositioning of the BB-loop induced by 
the BE interface opens the NADase active site 
(Fig. 4D). Large, positively charged side chains 
(of K509% R510, and K53"°°™) that would 
otherwise crowd the entrance of the active 
site are moved down with the BB-loop. The 
structure of a SARM1 mutant (G601P), in which 
the BB-loop is trapped in the unengaged state, 
reveals a lysine inserted inside the active-site 
cleft; this indicates that these side chains may 
act to prevent substrate binding (10). Further- 
more, NADase activity increased when equiv- 
alent BB-loop arginines were mutated to 
alanines in the plant RUN1 TIR domain (JO). 
Together, these studies suggest that these 
large, positively charged side chains serve 
to inhibit NADase function and must be dis- 
placed for TIR activation. 

Freeing the active site exposes conserved 
residues that have been proposed to recognize 
NAD” based on biochemical and structural 
studies using chemical analogs (9, 10). The 
nicotinamide moiety of the substrate is sup- 
posed to fit in the active-site cleft of the TIR 
domain, bringing the covalently linked ribose 
in close proximity to the catalytic glutamate. 
Mutating the putative catalytic glutamate in 
ROQI] (E86®°) to an alanine abrogates HR 
(fig. S4:), suggesting loss of NADase activity. 
No NAD’ was observed in our structure, which 
could have been cleaved by the activated TIR 
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domain, yet we still observed a small density 
positioned above the TIR domain active site at 
the BE interface (fig. S12). An ATP molecule 
was modeled at this position in Ma et al. (18). 
The corresponding density in our map is weak 
compared with that of neighboring residues 
(W82), with only part of an ATP molecule 
fitting in the density even at low contour levels 
(fig. S12). Therefore, it was left unmodeled. 
Further investigation will be required to de- 
termine the position of NAD* within the ROQI 
TIR domain active site. Mechanistic details of 
NAD* cleavage and product formation remain 
unresolved and have been found to vary among 
TIR domains (37). The steps in this enzymatic 
reaction involve breaking the glycosidic bond 
that connects nicotinamide to ADPR and, in 
some cases, a structural rearrangement in 
ADPR that leads to the formation of cyclic- 
ADPR or variant-cyclic-ADPR (9, 41). ADPR 
and cyclic-ADPR have been shown to mod- 
ulate the Ca”* level in plant cells, which is a 
widely used chemical signal for responding 
to various biotic and abiotic stresses (42). 

In the case of the fully assembled ROQI 
TNL, it is clear that the AE and BE interfaces 
are essential in TIR signaling. Both interfaces 
align the TIR domains in a conformation in- 
ducive to NADase active-site opening (Fig. 4E). 
Whether this mechanism of TIR association 
can be applied to other TNL remains to be 
determined. Similar to ROQ1, the TIR domains 
of the activated RPP1 tetramer form a dimer 
of dimers through AE- and BE-interface con- 
tacts, causing a rearrangement in the BB-loop 
(18). Nevertheless, there are likely to be alter- 
native ways for TIR domains to assemble based 
on the number of protomers required to build 
the active complex, heterocomplex formation 
with other NLRs, and interface requirements 
for activation. 


Summary 


Our structure of the ROQ1-XopQ complex, to- 
gether with previous biochemical studies, led 
us to propose a mechanism for TNL immune 
signaling: (i) both the LRR and C-JID recog- 
nize the pathogen effector, at which point the 
NR loop inserts itself into the active-site cleft 
of XopQ and targets conserved residues re- 
quired for nucleoside-binding; (ii) the NB-ARC 
domain is released by the LRR and undergoes a 
conformational switch to the ATP-bound oligo- 
merization state; (iii) ROQ1 protomers associ- 
ate into a four-leaf-clover structure and the TIR 
domains are brought in close contact; (iv) the 
TIR domains bind to each other, forming dis- 
tinct AE and BE interfaces and causing a con- 
formational rearrangement in the BB-loop of 
two of the subunits; and (v) the NADase active 
site is exposed, allowing for the cleavage 
of NAD”. Future work will be required to 
identify the exact molecular species produced 
by the TIR domains and how they are used by 


Martin et al., Science 370, eabd9993 (2020) 


the immune system of the host to trigger a 
response to pathogen invasion. 


Methods summary 


ROQ]-3Flag and StrepI-XopQ were transformed 
into Agrobacterium GV3101 and coexpressed 
in N. benthamiana eds1-1 mutant leaves through 
transient agroinfiltration. Leave tissue was har- 
vested 30 hours after infiltration and ground 
with a mortar and pestle. The powdered leaf 
material was then resuspended in a buffer 
supplemented with protease inhibitors and 
further lysed by sonication. Two rounds of 
centrifugation were required to separate cell 
debris from the soluble fraction of the lysate. 
Purification of the assembled Roql-XopQ com- 
plex was performed by sequential affinity chro- 
matography. First, we selected for ROQI-3Flag 
by Flag-immunoprecipitation. Strep-tactin resin 
was then used to capture the StrepII-XopQ 
containing complex. Individual steps can be 
visualized on a SDS-polyacrylamide gel electro- 
phoresis gel in fig. S13. 

The purified Rogl-XopQ complex was de- 
posited on a freshly plasma cleaned holey car- 
bon grid (QUANTIFOIL R2/2) coated with a 
thin layer of carbon. The original buffer used 
in the purification of the sample was removed 
and replaced with a similar buffer contain- 
ing 3% trehalose instead of NP-40 and glyce- 
rol. The grid was mounted onto a Thermo 
Fisher Scientific Mk. IV Vitrobot, blotted, and 
plunged-frozen in liquid ethane. 

Data were collected using a Titan Krios cryo- 
electron microscope operating at 300 kV and 
equipped with a K3 direct electron detector 
camera mounted behind a BioQuantum energy 
filter. A total of 11,134 dose-fractionated movies 
were acquired in superresolution counting 
mode, with an electron exposure of 50 e/A? 
and defocus values ranging from -0.9 to -2.5 um. 

All processing steps were done in Relion 3.1 
(43). CTF fits were calculated in Gctf (44). Par- 
ticles were selected from the motion corrected 
micrographs (45) using an unbiased Laplacian- 
of-Gaussian autopicker and were subjected 
to an initial round of 3D classification. One more 
round of refinement followed by alignment- 
free 3D classification resulted in a class of 
15,263 particles with a broad distribution of 
projection directions that converged to 3.8-A 
resolution [Fourier shell correlation (FSC) = 
0.143]. Symmetry expansion and focused refine- 
ment were required to resolve the LRR-PL-XopQ 
and TIR domain regions at a resolution of 3.8 
and 4.6 A, respectively. A detailed procedure 
is described in figs. S2 and S3. 

Individual models were built for each of the 
three cryo-EM reconstructions. An initial mod- 
el for the NB-ARC domain, TIR domain, and 
XopQ were generated using structures of ho- 
mologous proteins in the PDB (6J5T, 5KU7, 
and 4KLO, respectively). The LRR and PL do- 
main of ROQI1 were built manually in COOT 
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(46). Models were refined and validated in 
Phenix (47). A final model versus map FSC 
was calculated using MTRIAGE (48). Model 
validation statistics can be found in table S1. 

For HR phenotype observation, ROQ1-3Flag 
mutants and StrepII-XopQ were transiently 
coexpressed in NV. tabacum roqI-1 mutant leaves 
through Agrobacterium-mediated transforma- 
tion. HR phenotypes were observed and im- 
aged 2 days after infiltration. To test for protein 
expression, ROQ1-3Flag mutants and StrepII- 
XopQ were coexpressed and extracted from 
N. benthamiana edsI-1 mutant leaves and de- 
tected by Western blotting. 

Conservation scores for residues in the ROQI 
C-JID were calculated in Consurf (49) from a 
multiple sequence alignment provided in the 
supplementary materials (sequence alignment 
file S1). 
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The N-glycome regulates the endothelial-to- 


hematopoietic transition 


Dionna M. Kasper’, Jared Hintzen?*, Yinyu Wu", Joey J. Ghersi’*?, Hanna K. MandI?, 
Kevin E. Salinas’, William Armero”?, Zhiheng He’, Ying Sheng“, Yixuan Xie*, Daniel W. Heindel?, 
Eon Joo Park*®, William C. Sessa*°, Lara K. Mahal?”, Carlito Lebrilla’, 


Karen K. Hirschi’****, Stefania Nicoli”?*°* 


Definitive hematopoietic stem and progenitor cells (HSPCs) arise from the transdifferentiation of 
hemogenic endothelial cells (hemECs). The mechanisms of this endothelial-to-hematopoietic 
transition (EHT) are poorly understood. We show that microRNA-223 (miR-223)-—mediated 
regulation of N-glycan biosynthesis in endothelial cells (ECs) regulates EHT. miR-223 is enriched 
in hemECs and in oligopotent nascent HSPCs. miR-223 restricts the EHT of lymphoid-myeloid 
lineages by suppressing the mannosyltransferase a/g2 and sialyltransferase st3gal2, two enzymes 
involved in protein N-glycosylation. ECs that lack miR-223 showed a decrease of high mannose 
versus sialylated sugars on N-glycoproteins such as the metalloprotease Adam10. EC-specific 
expression of an N-glycan Adam10 mutant or of the N-glycoenzymes phenocopied miR-223 mutant 
defects. Thus, the N-glycome is an intrinsic regulator of EHT, serving as a key determinant of the 


hematopoietic fate. 


n vertebrates, definitive hematopoietic stem 

and progenitor cells (HSPCs) are specified 

during embryogenesis by the transdifferen- 

tiation of endothelial cells (ECs) within the 

aorta-gonad-mesonephros (AGM) region 
of the dorsal aorta (1-3). This endothelial-to- 
hematopoietic transition (EHT) occurs in 
hemogenic ECs (hemECs), a subset of ECs 
that coexpress vascular and hematopoietic 
genes. The precise interplay between multiple 
signaling cascades (1, 2, 4) enables the pro- 
gressive loss of endothelial and concomitant 
increase of hematopoietic gene expression, with 
hemECs giving rise to nascent HSPCs (2, 3). 
HSPCs then delaminate from the vascular wall 
and enter the circulation to colonize secondary 
hematopoietic organs, where they generate 
all blood cells throughout life (5-7). The EC 
and/or hemEC determinants that regulate 
EHT and thus HSPC production is not com- 
pletely understood. 
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miR-223 is expressed in ECs undergoing EHT 
Genetic deletion of microRNA-223 (miR-223^^) 
results in excess nascent HSPCs in the zebrafish 
AGM (8). However, the mechanisms underly- 
ing miR-223 function in EHT are unknown 
(8-11). To examine miR-223 expression dur- 
ing EHT, we generated a zebrafish transgenic 
reporter in which miR-223-expressing ECs 
are labeled with both green fluorescent pro- 
tein (GFP) and mCherry (miR-223:GFP” kdrl: 
mCH") (Fig. 1A, fig. SIA, and materials and 
methods). Endogenous miR-223 expression was 
enriched in miR-223:GFP” kdrl:mCH" cells ver- 
sus miR-223:GFP kdrl:mCH cells (fig. S1B). 
Moreover, reexpression of miR-223 from its pro- 
moter in miR-223:Gal4" cells rescued the emyb* 
HSPC overexpansion in miR-223° (fig. SIC). 
Thus, miR-223:GFP” cells report the endoge- 
nous expression and function of miR-223. 

miR-223:GFP* kdrl:mCH*" cells were dispersed 
in a Salt-and-pepper pattern, mostly within 
the AGM (Fig. 1A and fig. S1, D and E). The 
AGM population of miR-223:GFP* kdrl:mCH* 
cells increased from the onset to the peak of 
EHT, at 24 to 32 hours after fertilization (fig. 
SIE). They manifested a heterogeneous pat- 
tern of flat to bulging morphologies and un- 
derwent delamination from the AGM (Fig. 1A; 
fig. S1, F and G; and movie S1). miR-223:GFP* 
kdrl:mCH™ cells showed elevated expression 
of endothelial (kdrl) and EHT markers, in- 
cluding gata2b and runx!1 in hemECs (72) and 
cmyb in nascent HSPCs (5) (fig. SIH). Accord- 
ingly, nascent HSPCs showed elevated expres- 
sion of endogenous mature miR-223 (fig. SID), 
suggesting that miR-223 is expressed in ECs 
undergoing EHT. 
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To further discern the distinct molecular 
subtypes among miR-223-expressing ECs, we 
performed single-cell RNA-sequencing (ScRNA- 
seq) (fig. S2A and materials and methods). 
Kdrl:mCH* ECs formed a vascular tree com- 
posed of branches that corresponded to em- 
bryonic specification trajectories, identified 
through the expression of known markers 
(fig. S2, A and B, and data file S1). We ob- 
served the arterial trajectory from which cells 
coexpress a continuum of early and late EHT 
markers in hemECs and in nascent HSPCs 
(Fig. 1B, fig. S2B, and data file S1). This 
branch split into two trajectories that in- 
cluded nascent HSPCs expressing early primed 
markers for the lymphoid-myeloid lineages 
or lymphoid-erythroid lineages (Fig. 1C, fig. 
S2B, and data file S1). Gene ontology clas- 
sifications of branch-defining genes confirmed 
that as ECs progressed through the EHT traj- 
ectories, they lost vascular development 
terms while gaining hematopoietic, protein 
biosynthesis, and N-glycosylation terms (13). 
Cell migration and cell cycle transcripts 
were mostly acquired within primed nascent 
HSPCs, when they likely begin to delami- 
nate and/or amplify (fig. S2C and data file 
S2) (1, 2). Thus, our data suggest that oligo- 
potent nascent HSPCs are produced during 
EHT in the AGM. 

Additionally, we found that ECs that express 
miR-223:gfp transcripts comprised ~73 to 
100% of hemEC-nascent HSPC and lymphoid- 
myeloid-primed HSPC branches and only ~17% 
of the lymphoid-erythroid-primed HSPC branch 
(Fig. 1D and fig. S2, D and E). miR-223:gfp" ECs 
had enriched expression of EHT, lymphoid- 
myeloid-, and lymphoid-erythroid- primed 
HSPC lineage markers in the relevant trajec- 
tories (fig. S2F and data file S1). Together, 
these analyses suggest that miR-223 is en- 
riched in hemECs and nascent HSPCs during 
the EHT of oligopotent lymphoid-myeloid- 
primed HSPCs (Fig. 1E). 


miR-223 limits hemEC and lymphoid-myeloid- 
HSPC production 


To test miR-223 function in EHT, we pheno- 
typed miR-223 mutants. Zebrafish miR-223°/% 
compared with wild-type embryos displayed 
an increase in gata2b:GFP” or runx1:GFP* 
hemECs in the AGM (Fig. 2A and fig. $3, A 
and B). As observed previously, cmyb:GFP* 
kdrl:mCH™ nascent HSPCs were increased at 32 
and 36 hours after fertilization in miR-223^^ 
(Fig. 2B) (8). Similarly in mice, miR-223 was 
abundantly expressed in ECs and hemECs from 
the embryonic day 10.5 (E10.5) AGM. Moreover, 
global removal of miR-223 (9) displayed ele- 
vated hemECs in the E10.5 AGM and HSPCs 
at secondary hematopoietic sites (fig. S3, C 
to F). These results suggest that miR-223 is 
a conserved inhibitor of hemEC and HSPC 
production. 
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Fig. 1. miR-223 is expressed in hemECs undergoing EHT. (A) (Top) 
Schematic of miR-223:GFP reporter. (Bottom) Lateral Z-projections of the 
zebrafish AGM. White and yellow arrows point to flat and budding miR-223:GFP™ 
kdrl:mCH* cells, respectively. (B to D) Plots of wild-type kdrl:mCH™ trunk ECs 
2/ hours after fertilization showing expression of (B) EHT, (C) blood lineage, and 
(D) miR-223:gfp genes (n = 6227 cells). Arterial (branch 3) and EHT (branches 8, 


Next, we determined whether nascent HSPCs 
in miR-223 zebrafish have altered behaviors. 
First, miR-223* versus control nascent HSPCs 
had increased proliferative capacity (fig. S4A). 
Second, delamination times from the AGM 
were significantly longer (2.3 + 0.9 hours) for 
miR-223~ as compared with wild-type nas- 
cent HSPCs (fig. S4B and movies S2 and S3). 
Although slowed, cmyb* HSPCs eventually 
delaminated and were still expanded in sec- 
ondary hematopoietic organs of miR-223°/4 
embryos—namely, in the caudal hematopoietic 
tissue (CHT) at 2.5 days after fertilization (8), as 
well as in the thymus and kidney marrow at 
6 days after fertilization (fig. S4C). Thus, the 
excess HSPCs in miR-223“ exhibit aberrant 
proliferation and delamination. 

Last, we examined how miR-223 loss affects 
blood lineage differentiation of the supernu- 
merary HSPCs. Consistent with the relatively 
high expression of mzR-223:gfp in lymphoid- 
myeloid-primed HSPCs in the AGM (Fig. 1D 
and fig. S2E), lymphoid and myeloid progen- 
itors and differentiated cells were significantly 
expanded in secondary hematopoietic organs, 
whereas the erythroid lineage was unchanged 
(Fig. 2C and fig. S4D). Reexpression of wild- 
type miR-223 in miR-223°/4 ECs normalized 
cmyb’ HSPCs in the AGM and the lymphoid 
and myeloid blood cell lineages to control lev- 
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els (Fig. 2, C and D, and fig. S4E). These data 
suggest that miR-223 functions in ECs to re- 
strict the EHT of lymphoid-myeloid—primed 
HSPCs. 


miR-223 regulates N-glycosylation enzymes 
during EHT 


miRNAs typically bind miRNA-responsive ele- 
ments (MREs) within 3’ untranslated regions 
(3'UTRs) to fine-tune mRNA levels through 
decay and/or translational repression (14). There- 
fore, to identify miR-223 targets involved in 
EHT, we focused on transcripts that were up- 
regulated in miR-223°/“ ECs and harbored a 
miR-223 MRE (8). We found that four out of 
the top eight candidate genes encode en- 
zymes that regulate N-glycosylation (fig. $5, A 
to C, and data file S3). 

N-glycosylation begins in the endoplasmic 
reticulum (ER), where a preassembled glycan 
core is attached onto specific asparagines (N) 
of nascent polypeptides. The glycan is pro- 
gressively modified by glycosidases and gly- 
cosyltransferases as the protein traffics from 
the ER to the Golgi (fig. S5C) (5). The rate of 
glycan flux and/or the expression level of N- 
glycan biosynthesis enzymes (N-glycoenzymes) 
produce a diverse N-glycan repertoire, which 
determines protein activity and fundamental 
cell behaviors in reprogramming and onco- 
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11, and 4) trajectories are indicated. (E) During EHT, a flattened EC (red outline) 
gains hemogenic potential (orange outline) and buds as a oligopotent nascent 
HSPC (yellow outline) from the DA wall. Gradients for endothelial and EHT 
markers are colored on the basis of their expression in the indicated cell types. 
The green gradient and text represent miR-223 expression. DA, dorsal aorta: 
hpf, hours post fertilization; PCV, posterior cardinal vein 


genesis (16, 17). However, N-glycosylation has 
not yet been implicated in EHT. 

We investigated three canonical N-glyco- 
enzymes that were derepressed in miR-223°% 
ECs: alg2, an ER-resident mannosyltransfer- 
ase that incorporates mannose moieties in the 
N-glycan core; /man2lb, a mannose-binding 
lectin that promotes N-glycoprotein transport 
from the ER to the Golgi; and st3gal2, a Golgi- 
resident sialyltransferase that terminally mod- 
ifies N-glycoproteins with sialic acid sugars 
(fig. S5C and data file S3). 

To determine whether miR-223 negatively 
regulates these N-glycoenzyme transcripts 
through their 3'UTRs, we constructed sensors 
consisting of mCherry fused to a 3'UTR frag- 
ment of alg2, lman2lb, or st3gal2 that con- 
tains the putative miR-223 MRE (fig. S5D). 
The alg2 and st3gal2 3'UTR sensors showed 
significant mCherry repression in the AGM of 
miR-223^^ versus control embryos. By con- 
trast, the lman2lb 3'UTR sensor was similar 
between wild-type and mutant embryos (fig. 
S5D). These data suggest that miR-223 represses 
alg2 and st3gal2 transcripts through their 3'UTR 
in AGM ECs. 

We next investigated how alg2 or st3gal2 re- 
pression contributes to miR-223-mediated reg- 
ulation of EHT and HSPC production. We found 
that similar to miR-223, alg2 and st3gal2 were 
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expressed in ECs and increased in nascent 
HSPCs at 27 hours after fertilization (fig. S6A). 
Morpholino-induced down-regulation of alg2 
or st3gal2 expression reduced nascent HSPCs 


















in the AGM, without otherwise affecting 
development (fig. S6, B to E). Moreover, par- 
tial down-regulation of alg2 or st3gal2, which 
had no effect on wild-type embryos, rescued 
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Fig. 2. miR-223 is an endothelial inhibitor of EHT and lymphoid-myeloid-oligopotent HSPC production. 


(A and B) (Top) Z-projections or (bottom) mean + SEM numberof gata2b:GFP” kdrl:mCH” hemECs [n = 24 or 


25 embryos, (A)] or cmyb:GFP” kdrl:mCH™ nascent HSPCs [n = 13 to 18, 32 hours after fertilization; n= 34 
to 36, 36 hours after fertilization (8), (B)]. (C and D) (Top) Representative images or (bottom) mean + SEM area 
of (C) T-lymphocyte marker rag] expression (n = 18 to 35 embryos) and Sudan black” neutrophil clusters 


(n = 36 or 37 embryos) and (D) cmyb expression (n = 18 to 21 embryos). The EC miR-223 rescuing construct 


is described in fig. S4E. DA, dorsal aorta; dpf, days post fertilization; hpf, hours post fertilization; PCV, 
posterior cardinal vein. Significance is represented as not significant (n.s.) P > 0.05, *P < 0.05, **P < 0.01, 
***P < 0.01, ****P < 0.0001, unpaired, two-tailed Mann-Whitney U test. 
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HSPC expansion in miR-223*/“ embryos (fig. 


S6F). Congruently, alg2 and st3gal2 gain of 
function in ECs increased nascent HSPCs (fig. 
S7A). Likewise, mutation of the miR-223 MRE 
in the alg2 or st3gal2 3'UTR (fig. S7, B to D) 
also recapitulated miR-223^^ phenotypes: 
hemECs and HSPCs were expanded in the 
AGM, as were HSPCs and differentiated 
lymphoid-myeloid lineage cells within the 
secondary hematopoietic tissues (Fig. 3, A 
and B, and fig. S7, E and F). Together, these 
data suggest that miR-223 posttranscription- 
ally represses alg2 or st3gal2 to regulate EHT 
and HSPC differentiation. 


The N-glycome controls EHT protein function 


To test whether an altered N-glycome in ECs 
might lead to aberrant EHT in miR-223°4, 
we first profiled changes in specific glycan 
subtypes between wild-type and miR-2234 
ECs (Fig. 4A and fig. S8A). We found that high 
mannose (~71%) and sialylated complex/ 
hybrid (C/H) (~18%) were the predominant 
glycans on proteins from wild-type ECs. By 
contrast, sialylated C/H (~97%) and sialofu- 
cosylated C/H (~3%) glycans were on proteins 
from ECs lacking miR-223, which was conco- 
mitant with a remarkable loss of high man- 
nose type N-glycans (Fig. 4, B and C, and data 
file S4). Whole embryos overexpressing alg2 
or st3gal2 in ECs or treated with the sialic acid 
precursor N-acetylmannosamine possessed 
both HSPC expansion and increased sialylated 
glycan subtypes at the expense of high man- 
nose modifications, similar to those of miR- 
223^^ embryos (Fig. 4D; figs. S7A and S8, B 
to D; and data file S4). Thus, our analysis sup- 
ports that the balance between high man- 
nose versus sialic acid-containing N-glycan 
subtypes in ECs is regulated through miR- 
223-mediated repression of alg2 or st3gal2 
N-glycoenzymes. 

We next identified the N-glycoproteins that 
were altered in miR-223^^ compared with 
wild-type embryos (fig. SSE). The N-glyco- 
proteins with reduced high mannose and/ 
or increased sialylated C/H and sialofuco- 
sylated C/H modifications in miR-223^^ have 
known biological functions relevant to EHT 
and HSPC production, and their transcripts 
were expressed in ECs during EHT (Fig. 5A, 
fig. S8F, and data file S4). Of high interest was 
the metalloprotease Adam10a, which is critical 
for the proteolytic release of several cell sur- 
face proteins such as Notch, Vegfr2, tumor 
necrosis factor-a (TNF-a), and other EHT 
growth factor signaling molecules (78, 19). 
We found that the number of nascent HSPCs 
were significantly decreased in the AGM of 
adaml0a guide RNA (gRNA)/Cas9-injected 
embryos, which is consistent with Adam10- 
mediated regulation of several EHT factors 
(Fig. 5, B and C, and fig. S8G). Down-regulation 
of adami0a in miR-223~“ decreased nascent 
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Fig. 3. miR-223 limits EHT and lymphoid- 
myeloid-oligopotent HSPC production 
through repression of alg2 and st3gal2. 
(A) (Top) Arrows indicate cmyb:GFP* 
kdrl:mCh* nascent HSPCs in the zebrafish 
AGM. (Bottom) Mean + SEM of cmyb:GFP” 
kdrl:mCH* cells (n = 28 to 48 embryos). 

(B) (Top) Images or (bottom) mean + SEM 
area of T-lymphocyte marker rag] expression 
(n = 28 to 35) embryos and Sudan black” 
neutrophil clusters (n = 21 or 37 embryos). 
Abbreviations and significance calculations are 
as in Fig. 2. 
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Fig. 4. miR-223 regulates the endothelial glycome. (A) Extraction procedure 
to assess endothelial N-glycan profiles 27 hours after fertilization. Extracted 
N-glycans were analyzed by means of mass spectrometry (MS). (B) MS 
chromatograms for ECs showing relative abundance of N-glycan subtypes. A 


HPSC numbers to wild-type levels, suggesting 
that Adam10a function was enhanced upon 
miR-223 loss (Fig. 5, B and C). We then over- 
expressed in ECs the wild-type adaml0a or a 
N279 mutant version that prevents the high 
mannose attachment at this site (fig. S8, H and 
I) (20). We found that wild-type adam1l0a— 
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and, even further, the adaml0a N279mut— 
exhibited an expansion of nascent HSPCs in 
the zebrafish AGM (Fig. 5, B and C). These 
data suggest that decreased high mannose ter- 
minal modification of Adam10a promotes its 
function and contributes to the EHT pheno- 
type of miR-223°/ embryos. 
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subset of the identified N-glycan structures are shown (monosaccharide legend 
is provided in fig. S8A). (C and D) Relative abundance of N-glycan subtypes 

in (C) ECs 27 hours after fertilization or (D) whole embryos. N-glycan subtypes 
are colored as in (B). EC GOF, endothelial cell gain of function. 


Discussion 

Our study reveals a role for the N-glycan bio- 
synthesis pathway in restricting the produc- 
tion of hemECs, HSPCs, and distinct blood 
lineages. We discovered an intrinsic mecha- 
nism that negatively regulates the transdiffer- 
entiation of ECs into oligopotent HSPCs on 
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Fig. 5. A distinct N-glycan repertoire regulates protein function to 

limit EHT. (A) Log> (fold change) of N-glycan relative abundance for miR-223- 
regulated N-glycopeptides (n = 3 replicates). (B) White arrows indicate nascent 
HSPCs in the zebrafish AGM of embryos injected with a gRNA that diminishes 
adam10a or with EC gain of function (GOF) adam10a constructs. (C) Mean + SEM 
number of cmyb:GFP* kdrl:mCH* HSPCs (n = 13 to 18 embryos). Abbreviations and 


the basis of a “sugar code” comprising spe- 
cific cell-surface glycans, high mannose, and 
sialylated C/H subtypes. This regulation is 
mediated through miR-223-dependent repres- 
sion of N-glycan biosynthesis genes, which 
controls the glycan repertoire to ensure normal 
HSPC production and differentiation (Fig. 5D 
and fig. S8J). 

Loss of miR-223-mediated regulation of N- 
glycoenzymes causes a switch in N-glycans 
and enhances the function of N-glycoproteins. 
For example, Adam10a EC-autonomous activ- 
ity is required for EHT but causes excessive 
HSPC production when lacking high mannose, 
probably through enhanced regulation of Notch, 
TNF-a, and/or other Adam10-dependent signal- 
ing pathways. 

Furthermore, because EHT is a heteroge- 
neous continuum of cell states (13), particular 
hemECs and nascent HSPCs could require miR- 
223 regulation of N-glycoprotein-dependent 
signaling to restrict EHT and balance blood 
production (Fig. 5D and fig. S8J). Congruently, 
N-glycoproteins throughout the EHT proc- 
ess could influence the differentiation of 
oligopotent HSPCs. Our findings lay the foun- 
dation for mechanistic studies of how protein 
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N-glycosylation balances the diverse array of 
hematopoietic regulators during EHT. 

The discovery that EHT is regulated by a 
specific glycan profile provides avenues to 
explore for the regulation of in vivo and ex vivo 
blood stem cell production (27). The wide 
variety of known glycan metabolism modifiers 
and inhibitors could be used to glyco-engineer 
hemECs, optimizing the production of glyco- 
forms that facilitate somatic cell reprogram- 
ming to HSPCs (22). Thus, our findings could 
inform pharmacological strategies to produce 
HSPCs for therapeutic interventions. 
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Shape perception via a high-channel-count 
neuroprosthesis in monkey visual cortex 


Xing Chen’*, Feng Wang", Eduardo Fernandez’, Pieter R. Roelfsema 


1,3,4% 


Blindness affects 40 million people across the world. A neuroprosthesis could one day restore 
functional vision in the blind. We implanted a 1024-channel prosthesis in areas V1 and V4 of the 
visual cortex of monkeys and used electrical stimulation to elicit percepts of dots of light (called 
phosphenes) on hundreds of electrodes, the locations of which matched the receptive fields of the 
stimulated neurons. Activity in area V4 predicted phosphene percepts that were elicited in V1. We 
simultaneously stimulated multiple electrodes to impose visible patterns composed of a number 
of phosphenes. The monkeys immediately recognized them as simple shapes, motions, or letters. 
These results demonstrate the potential of electrical stimulation to restore functional, life-enhancing 


vision in the blind. 


lectrical stimulation of the visual cortex 
has long been proposed as an approach 
to restore vision in blind people, by- 
passing severe retinal degeneration or 
damage to the eye or the optic nerve. 
Experiments in humans (1-10) and animals 
(11-20) have shown that electrical stimulation 
of visual cortex can reliably evoke the percep- 
tion of a dot of light, known as a phosphene. 
The phosphenes elicited in the primary visual 
cortex are typically perceived as white or gray 
dots, although they are occasionally colored 
(2-4, 8), and range in size from a pinpoint to 
about 2 cm in diameter at arm’s length. Sub- 
jects perceive phosphenes at the location of 
the receptive field of the stimulated neurons 
(9, 10, 17, 20). Notably, phosphenes induced in 
blind but previously sighted subjects are com- 
parable to those in normally sighted subjects, 
even after decades without sight (2, 4, 8). 

A typical visual prosthesis system would con- 
sist of a camera that the user wears on a pair of 
glasses and a portable processor that trans- 
forms camera footage into instructions for 
electrical stimulation of the visual cortex (27. 
We hypothesized that simultaneous stimula- 
tion of multiple electrodes would create recog- 
nizable shapes in the user’s perception. The 
generation of artificial percepts would require 
a substantial number of functional electrodes 
in the visual cortex, covering a sufficiently large 
fraction of the visual field. Most previous 
studies have focused on the properties of in- 
dividual phosphenes and the electrical stimu- 
lation parameters that reliably evoke them. 
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One recent study (22) demonstrated shape 
perception by sequentially stimulating a num- 
ber of electrodes positioned on the surface of 
the visual cortex, to trace out a shape. How- 
ever, sequential stimulation limits the amount 
of information that can be transferred per 
unit of time, and the generation of shapes via 
simultaneous electrical stimulation of multi- 
ple electrodes in the visual cortex remains to 
be demonstrated (23, 24). 


An implant with 1024 electrodes in the 
visual cortex 


Most [although not all (3, 8)] previous studies 
in humans used electrodes that were posi- 
tioned on the surface of the brain and required 
delivery of currents in the milliampere range, 
limiting the number of electrodes that could 
safely be stimulated at the same time. Surface 
electrodes activate several millimeters of cor- 
tex and produce large phosphenes, and this 
results in low spatial resolution. Furthermore, 
previous studies have observed interference 
between the percepts elicited by simultaneous 
stimulation on nearby electrodes (2, 3, 11, 22). 
By contrast, microstimulation by intracorti- 
cal electrodes requires currents that are two 
orders of magnitude lower (3, 8, 15, 16, 19), 
activates neurons located within a few hun- 
dred micrometers of the electrode tip, and po- 
tentially yields higher-resolution phosphene 
percepts (25) (Supplementary text). 

To investigate the generation of artificial 
visual percepts using a large number of intra- 
cortical electrodes, we created a 1024-channel 
implant consisting of a titanium pedestal con- 
nected to 16 Utah arrays (26), each with eight 
rows and columns of 1.5-mm-long shanks (fig. 
SIA). We tested the system in two macaque 
monkeys, L and A. In both monkeys, 14 arrays 
were tiled across the left primary visual cortex 
(area V1) (Fig. 1, A and B). Two arrays were 
placed in V4 (fig. S2), allowing us to monitor 
the effects of V1 stimulation on neuronal ac- 
tivity in a higher cortical area. 
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We presented moving bars (27) and ob- 
tained multiunit receptive fields (RFs) on 820 
of 896 and 616 of 896 V1 electrodes and on 
120 of 128 and 92 of 128 V4 electrodes in 
monkeys L and A, respectively (movies SI 
and S2). In accordance with known retino- 
topy (28), the RFs covered the foveal and 
parafoveal regions of the lower right visual 
field, with V1 RF eccentricities of <9° and <5° 
(Fig. 1, C and D) and V4 RF eccentricities of 
<5° and <15° in monkeys L and A, respective- 
ly (fig. S2). 


Saccade-to-phosphene task 


We used a saccade-to-phosphene task (Fig. 2A 
and movie S3) to investigate phosphene loca- 
tions and determine current thresholds for 
phosphene perception CN = 184 electrodes in 
monkey L, and N = 164 in monkey A). The 
monkeys had previously been trained to make 
eye movements to visually presented dots on a 
computer monitor, and we replaced the visual 
stimuli by electrical microstimulation on indi- 
vidual V1 electrodes. Figure 2B shows the 
probability of saccade execution as a function 
of current amplitude during stimulation of an 
example V1 electrode in monkey L. The cur- 
rent threshold, Cs), defined as the level at 
which the monkey reported the phosphene 
50% of the time, was 12.6 uA. Across elec- 
trodes, the median threshold in monkey L was 
23 uA [interquartile range (IQR), 6 to 40 uA] 
and in monkey A it was 50 uA (IQR, 43 to 58 uA) 
(Fig. 20). 

We recorded neuronal activity in V4 during 
V1 microstimulation (Fig. 2D) and removed 
the electrical artifacts caused by stimulation 
from the V4 signal (28). Figure 2E shows the 
mean response across 17 V4 electrodes, during 
the same session as that in Fig. 2B. The V4 
activity increased as a function of V1 current 
amplitude, with a particularly steep relation- 
ship around the Cs, of the electrode (Fig. 2F). 
We calculated the neurometric threshold, Ro, 
as the current amplitude at which V4 activity 
reached 50% of its maximum. The Rs 9 was 
12.4 uA, similar to the Cso of 12.6 uA. Across 
the currents tested on this V1 electrode, the 
correlation between the V4 activity level and 
hit rate was significant [correlation coefficient 
(r) = 0.94, p < 0.001 (Pearson’s correlation)]. 

It would be advantageous if V4 activity 
could be used more generally to estimate V1 
current thresholds, because prospective pros- 
thesis users might experience fatigue during 
the determination of thresholds on hundreds 
of electrodes based on visibility reports. The 
Rso could be measured while the user is en- 
gaged in an unrelated activity. We observed a 
high correlation between the Rso and Cso in 
both animals [monkey L: r = 0.98, N = 169, 
p < 0.001, 104 unique V1 electrodes for which 
stimulation artifacts were removed successfully; 
monkey A: r = 0.96, N = 64, p < 0.001, 44 V1 
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electrodes (Pearson’s correlation)] (Fig. 2G). 
To further test how well V4 activity relates 
to phosphene visibility, we divided the V1 
electrodes into three equally large groups 
based on the Cs) and determined the average 
V4 response as a function of V1 current am- 
plitude within each group (Fig. 2H). The shape 
of V4 response as a function of current ampli- 
tude resembled the psychometric function 
within each group, confirming that V4 activity 
can be used to estimate V1 current thresholds. 

Although we used a large target window 
(see methods), the monkeys’ eye movements 
were generally directed toward the RFs of 
the stimulated neurons (/4) (compare Fig. 1, 
E and F to C and D; movie S3). To quantify 
this relationship, we computed the correla- 
tion between the polar angle of the mean 
saccadic end point and that of the RF (Fig. 2, 
I and J), which was significant in both mon- 
keys (monkey L: r = 0.77, p < 0.001, N = 184; 
monkey A: 7 = 0.67, p < 0.001, N = 164). The 
correlation between the eccentricity of the 
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saccade end point and that of the RF was also 
significant [monkey L: r = 0.90, p < 0.001; 
monkey A, r = 0.72, p < 0.001 (Pearson’s corre- 
lation)], although there was a consistent under- 
shoot of the saccade relative to the RF [saccade 
amplitude as a fraction of RF eccentricity in 
monkey L: 0.77 + 0.18 (mean + SD), N = 184; 
monkey A: 0.71 + 0.21, N = 164], which was 
more pronounced for weaker stimulation cur- 
rents (fig. S3). These findings confirm that 
phosphenes are perceived at the RF of the 
stimulated neurons (9, 10, 12, 14, 17, 20). The 
next experiments addressed whether multi- 
electrode stimulation elicits interpretable arti- 
ficial percepts. 


Generation of composite visual percepts 


First, we tested whether the monkeys could 
report the spatial alignment of two phosphenes 
(vertical or horizontal) by requiring them to 
saccade to a target above or below the fixa- 
tion spot, respectively (Fig. 3A and movie S4). 
Before array implantation, the animals were 





Fig. 1. Location of electrodes and receptive fields. (A) Locations of arrays in areas V1 and V4 of the visual 
cortex in the left hemisphere of monkey L. (B) Higher-magnification view of array locations. Array numbers 
correspond to the order of attachment between the arrays and the pedestal. (C and D) RF centers. Colors 
correspond to those of the arrays in (A) and (B). (E and F) Average saccadic end points. Colors correspond 


to those of the arrays in (A) and (B). 
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trained extensively on a visual version of the 
task, in which they reported the relative posi- 
tion of two small dots on a screen. In every 
session of the microstimulation task, we se- 
lected unfamiliar pairs of electrodes for simul- 
taneous stimulation, such that the RFs of the 
stimulated neurons were vertically or horizon- 
tally aligned. 

Figure 3B shows the results for an example 
session, with mean accuracies of 91 and 88% 
for the vertical and horizontal conditions, re- 
spectively. Performance was above chance level 
from the start of the session even though the 
electrode set was novel, which suggests that the 
monkey generalized its understanding from 
the visual to the phosphene version of the task. 
The average accuracy across sessions with novel 
electrode pairs was significantly above chance 
[monkey L: 70 + 13%, N = 19 electrode sets, tig = 
6.6, p < 0.001; monkey A: 64 + 18%, N = 11, to = 
2.6, p = 0.026 (one-sample t test)] (Fig. 30). We 
also ran a visual version of the task in which 
we presented dots on the screen at the RF lo- 
cations. Monkey L reached an accuracy of 83 + 
10% (tg = 15, p < 0.001), and monkey A 79 + 
15% (to = 6.2, p < 0.001). These visually pre- 
sented dot configurations were not always 
novel, that is, the monkeys had seen some of 
these dot pairs in previous sessions. 

We rewarded the monkeys after correct sac- 
cades; hence, they might have learned to asso- 
ciate arbitrary phosphene percepts with specific 
targets by trial and error. We therefore com- 
puted the mean accuracy across electrode sets 
during the first 30 trials of the microstimula- 
tion task (Fig. 3D). The animals’ accuracies re- 
mained relatively stable across time. For both 
monkeys, we determined the critical trial, c, 
for which the accuracy was higher than chance 
(p < 0.05, binomial test) when trials 1 through 
c were included, across electrode sets (28). In 
monkey L, c was the first trial [responses were 
correct on 17 of 19 electrode sets; p < 0.001 
(binomial test)], and in monkey A, c was the 
fourth trial (30 of 44 electrode sets and trials; 
p = 0.011). Furthermore, no significant improve- 
ment in performance occurred between trials 
1 to 10 and trials 21 to 30 [monkey L: tig = 0.0, 
p = 1.0; monkey A: tio = 0.17, p = 0.87 (paired 
t test)]. We conclude that the monkeys gen- 
eralized from their experience on the visual 
task to judge whether novel phosphene pairs 
were horizontally or vertically aligned. 


Direction-of-motion task 


The monkeys had also been trained to report 
the apparent motion direction of a sequence of 
visually presented dots. We investigated wheth- 
er the monkeys could interpret the direction of 
successively generated phosphenes (Fig. 3E and 
movie S5). We selected three electrodes with 
a vertical or horizontal offset and stimulated 
them sequentially such that their RFs formed 
a sequence, going from top to bottom or bottom 
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Fig. 2. Saccade-to-phosphene task. (A) Illustration of the task. The yellow 
circle represents a phosphene; the arrow represents an eye movement to the 
artificially induced perception. (B) Probability of eliciting a saccade, as a function 
of current amplitude, during stimulation on an example V1 electrode in monkey L. 
The dashed line indicates the current threshold; error bars represent SEM. 

(C) Cumulative distribution of current thresholds. (D) Schematic showing recording 
from V4 during V1 microstimulation. (E) Time course of activity (relative to 
stimulation onset), averaged across V4 channels during V1 microstimulation 
[same session as in (B)]. The dashed lines indicate the time window during which 
V4 activity was measured. (F) V4 activity as a function of stimulation current 
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[same session as in (B) and (E)]. The dashed line indicates the current threshold; 
error bars represent SEM. (G) Correlation between current thresholds derived 
from neurometric (R59) and psychometric functions (Cso). (H) Mean performance 
(top) and mean V4 response (bottom) as a function of current amplitude for 

V1 electrodes with low (blue), intermediate (red), and high thresholds (yellow; 
tertiles). Vertical error bars represent SEM of performance levels (upper) or V4 
responses (lower); horizontal error bars represent SEM of current levels. (I and J) 
Comparison of polar angle (I) and eccentricity (J) between RFs and saccadic 

end points. The blue line shows a linear regression, which was constrained to pass 
through the origin. 
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A 


Fig. 3. Two-phosphene orientation 
and direction of motion task. 

(A) Two-phosphene orientation 
discrimination task. The monkey 
reported with an eye movement 
whether two phosphenes (or visually 
presented dots) were oriented 
horizontally or vertically. (B) (Left) 
RF centers for an example set of 
four electrodes in monkey A. Black 
circle, fixation point; orange dots, 
horizontal condition; purple dots, 
vertical condition. (Right) Accuracy 
(averaged across five trials) for 

this example set. (C) Distribution 

of accuracies across electrode sets 
during microstimulation (green) and 
visual (blue) versions of the task. 
(D) Mean accuracy across electrode 
sets and monkeys, as a function of 
trial number, on the microstimulation 
version of the task (averages and 
SEM across five trials). (E) Direction- 
of-motion task. (F) (Left) RF centers 
for an example set of three elec- 
trodes, stimulated in sequence 
(indicated by arrows). (Right) 
Accuracy for this example set. 

(G and H) Similar to (C) and (D), 
for the direction-of-motion task. 


| Microstim. | 


Visual 


to top for “vertical” sets of electrodes and left 
to right or right to left for “horizontal” sets of 
electrodes. Figure 3F illustrates an example 
session in monkey A, in which the RFs had a 
vertical offset and the monkey reached high 
accuracy from the first trial onward. Across 
all electrode sets of the microstimulation task 
(Fig. 3G), the average accuracies of monkeys L 
and A were 76 + 17% [N = 13, ts = 5.5, p < 0.001 
(¢ test)] and 69 + 16% (N = 19, tg = 5.2, p < 
0.001), respectively. The monkeys’ accuracies 
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for novel electrode combinations were higher 
than chance from the first trials onward. The 
critical trial c was trial 1 in monkey L (correct 
on the first trial in 11 of 13 sessions; p = 0.011) 
and trial 4 in monkey A (trials 1 to 4, 48 of 76 
correct; p = 0.014) (Fig. 3H), indicating that 
they were able to generalize their understand- 
ing of apparent motion from the visual to the 
phosphene task. No significant improvement 
in performance occurred between trials 1 to 10 
and trials 21 to 30 [monkey L: ty» = 1.35, p = 
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0.2; monkey A: tig = —-1.85, p = 0.08 (paired 
t test)]. The accuracies of monkeys L and A in 
the task with a sequence of visually presented 
dots were 74 + 11% (fy = 8.2, p < 0.001) and 
89 + 11% (tig = 15.6, p < 0.001), respectively, 
although these dot combinations were not al- 
ways novel. 


Letter task 


Next, we examined the possibility of creating 
more complex shape percepts. We delivered 
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stimulation on 8, 10, or 15 electrodes in V1 
simultaneously, selecting the electrode sets 
such that the RFs collectively formed the shape 
of one of two letters. The number of stim- 
ulated electrodes was always the same be- 
tween the two conditions in a given session. 
Before electrode implantation, the monkeys 
had learned to report the identity of 16 visu- 
ally presented letters on a screen by making 
an eye movement to a target. The training 
program also included letters presented as 
sparse dot patterns, designed to simulate phos- 
phene vision (fig. S4) (29). After array im- 
plantation, the training continued, and we 
initially alternated visual and microstimula- 
tion blocks. Thereafter, sessions consisted 
solely of visual or microstimulation trials. 
During visual trials, one of two letters was 
presented as a dot pattern on the screen. 
During microstimulation trials, we delivered 
simultaneous electrical stimulation to a nov- 
el set of electrodes for 167 ms (50 pulses at 
300 Hz) (Fig. 4A and movie S6). 

Figure 4B shows the accuracy of monkey A 
for an example set of electrodes on the micro- 
stimulation version of the task. In one condi- 
tion, the RFs of the stimulated electrodes 
formed the letter T (orange dots) and in the 
other condition they formed the letter L (pur- 
ple dots). In this example session, accuracy 
was 0.89 and 0.96 for the T and L conditions, 
respectively, and it was relatively high from 
the first trial onward. Across all electrode sets 
(Fig. 4C), the average accuracy of monkey L 
was 81 + 18% LN = 9, tg = 5.4, p < 0.001 (¢ test) ] 
and the accuracy of monkey A was 71 + 18% 
(N = 10, tg = 3.6, p = 0.006) on the microstim- 
ulation version of the task. Accuracy on the 
visual version of the task was 88 + 5% for 
monkey L (tg = 23.1, p < 0.001) and 93 + 4% 
for monkey A (tg = 36.7, p < 0.001), although 
some of these dot displays were not novel. 

To control for the possibility that the mon- 
keys learned to map arbitrary percepts onto 
saccade targets, we examined the accuracy 
time course (Fig. 4D). The critical trial c (ac- 
curacy above chance level, binomial test) was 
trial 3 in monkey L (20 of 27 correct, p = 
0.0096), and trial 7 in monkey A (43 of 70 cor- 
rect, p = 0.036). No significant improvement 
in performance occurred between trials 1 to 10 
and trials 21 to 30 [monkey L: tg = —0.7, p = 
0.5; monkey A: fy = 0.7, p = 0.5 (paired ¢ test)]. 

As a further control, we examined how long 
it took the monkeys to associate arbitrary 
microstimulation patterns with eye movements. 
We selected novel, nonoverlapping combina- 
tions of electrodes, such that their RFs did not 
form any recognizable letter (N = 10 electrode 
sets in both monkeys). The mean accuracy of 
monkeys L and A during the first 100 trials 
was close to chance level (48.2 and 46.9%; p > 
0.2 in both monkeys) (gray data points in Fig. 
4D) and lower than during the first 100 trials 
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of the microstimulation task with familiar let- 
ters (monkey L, ty = 5.7, p < 0.001; monkey A, 
tig = 2.4, p = 0.03). We also ran a visual version 
of the control task, in which the monkeys saw 
two novel visual dot stimuli. The mean accu- 
racies of monkeys L and A were 50.5 and 
49.9%, respectively (p > 0.9), indicating that 
they also failed to rapidly learn to map un- 
familiar dot stimuli onto saccade targets (Fig. 
4D and fig. S5). 

These results indicate that the monkeys’ 
abilities to perform the letter task were not 
due to rapid learning of new associations be- 
tween stimuli and eye movement targets. 
Rather, the phosphene percepts must have 
borne a close resemblance to the visually 
presented letters, allowing the monkeys to 
benefit from their extensive prior experience. 


Discussion 


We demonstrated that the simultaneous stim- 
ulation of multiple electrodes in V1 gives rise 
to the perception of shape and that successive 
stimulation gives rise to the perception of mo- 
tion, providing proof of concept for the use of 
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electrical microstimulation to create a form of 
artificial vision in the blind. Previous studies 
suggested that the successful generation of 
phosphene percepts can be read out in higher 
visual cortical areas (30, 31). Here, we dem- 
onstrated how V4 activity can be used to 
determine phosphene thresholds in V1, circum- 
venting a time-consuming calibration pro- 
cess in which the prosthesis user has to report 
whether currents are above or below the per- 
ceptual threshold on hundreds of electrodes. 

These results capitalized on several develop- 
ments. First, we implanted a large number of 
electrodes with foveal and parafoveal cover- 
age of one quadrant of the visual field. Simu- 
lations of phosphene vision estimated that 
whereas letter recognition is possible with a 
few hundred electrodes, fluent reading and 
object recognition in crowded scenes may re- 
quire thousands of electrodes (32, 33). By im- 
planting more than 1000 electrodes across a 
large area of V1, we could generate numer- 
ous phosphenes across a contiguous region of 
the visual field (Fig. 1) and elicit recognizable 
shapes (Figs. 3 and 4). 
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Fig. 4. Letter recognition task. (A) Illustration of the task. The monkey reported letter identity with an 
eye movement. (B) (Top) RF centers of eight stimulated V1 electrodes whose RF centers formed a T or an 
L (orange and purple circles). (Bottom) Accuracy (smoothed across five trials) for this example set. 

(C) Accuracy for sessions on microstimulation (green) and visual (blue) versions of the task. (D) Mean 
accuracy across electrode sets and monkeys, as a function of trial number (average and SEM across five 
trials) in the microstimulation (green) and visual (blue) versions of the task. In a control task with novel 
electrical stimulation and visually presented dot patterns, the accuracy was close to chance (gray). 
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Secondly, we used intracortical electrodes, 
which have advantages over the subdural 
electrodes used in previous human studies 
(2, 10, 34). Subdural electrodes require cur- 
rents in the milliampere range and activate 
populations of neurons across millimeters of 
cortex (9, 10), which limits their resolution. 
Furthermore, the stimulation of adjacent sub- 
dural electrodes may cause interference, gen- 
erating a single, large phosphene instead of 
several small ones. Using depth electrodes, 
we elicited phosphenes with median stimu- 
lation currents of 23 to 50 uA, which activate 
small populations of neurons located within 
a few hundred micrometers of the electrode 
tip (25). Hence, intracortical electrodes in- 
duce smaller and more precise phosphene 
percepts (3, 8), but a systematic comparison 
of phosphenes produced by subdural and in- 
tracortical stimulation has yet to be performed 
(supplementary text) (24). 

Third, before implantation the monkeys 
underwent intensive training on visual versions 
of the tasks, in which they saw shapes com- 
posed of small dots, designed to simulate phos- 
phene percepts (29). As a result, the monkeys 
were immediately able to correctly interpret 
shape and motion stimuli composed of mul- 
tiple phosphenes. Nevertheless, the monkeys 
made mistakes and even performed close to 
chance for several electrode sets. The accuracy 
in the tasks with visually presented dot pat- 
terns was also not perfect, indicating that the 
monkeys did not always recognize them, al- 
though this may have been caused by occa- 
sional drops in motivation (e.g., at the end of 
a recording session). We expected the accu- 
racy for the tasks with electrical stimulation 
to be worse, because it is not possible to con- 
trol the appearance of individual phosphenes. 
Humans report that phosphenes induced by 
stimulation of early visual cortical areas vary 
in color, brightness, shape, and perceived dis- 
tance (3, 6, 8, 35). Furthermore, simultaneous 
stimulation on two electrodes that are a few 
hundred micrometers apart can yield fused 
phosphenes (3). Such variability and potential 
interactions may make electrically induced 
percepts more difficult to interpret than visu- 
ally presented ones. 

A number of technical hurdles still have to 
be overcome before a visual cortical prosthe- 
sis is available for humans. The mean surface 
area of human primary visual cortex is 25 to 


Chen et al., Science 3'70, 1191-1196 (2020) 


30 cm? per hemisphere, and future implants 
should cover a large enough region of the 
visual field with a sufficient density of phos- 
phenes to generate interpretable percepts 
(36, 37). Furthermore, it will be necessary to 
create wireless technology with high channel 
counts and to develop long-lasting, biocom- 
patible electrodes that minimize the risk of 
gliosis, tissue trauma, and encapsulation (38). 
Much progress is being made in the develop- 
ment of brain-computer interfaces for sen- 
sory restoration and motor prostheses (39-41). 
Combined with the present demonstration 
of artificial vision, these developments place 
a light at the end of the tunnel for those with- 
out sight. 
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The exploitative segregation of plant roots 


Ciro Cabal", Ricardo Martinez-Garcia*", Aurora de Castro Aguilar’, 


Fernando Valladares**, Stephen W. Pacala’ 


Plant roots determine carbon uptake, survivorship, and agricultural yield and represent a large 
proportion of the world’s vegetation carbon pool. Study of belowground competition, unlike aboveground 
shoot competition, is hampered by our inability to observe roots. We developed a consumer-resource 
model based in game theory that predicts the root density spatial distribution of individual plants 

and tested the model predictions in a greenhouse experiment. Plants in the experiment reacted to neighbors 
as predicted by the model's evolutionary stable equilibrium, by both overinvesting in nearby roots and 
reducing their root foraging range. We thereby provide a theoretical foundation for belowground allocation 
of carbon by vegetation that reconciles seemingly contradictory experimental results such as root 
segregation and the tragedy of the commons in plant roots. 


root system’s ability to obtain soil re- 
sources essential for growth and sur- 
vival determines a plant’s yield and is 
highly influenced by competition with 
other plants’ roots (D). However, we have 
few observations of intact root systems in soil 
and lack a comprehensive theory for root sys- 
tem responses to their environment (2). Agri- 
cultural technologists need to understand the 
mechanisms by which plants compete below- 
ground to design and breed ideal cultivars 
(3). Understanding the response of these cul- 
tivars to different plant densities is crucial to 


Fig. 1. Spatial distribution across one 
horizontal dimension of root density 

in solitary plants and pairs of interacting 
individuals separated by a distance 

d =10 cm. The y axis measures root density, 
not depth. (A to C) Results from the 
mathematical model solved under (A) Pareto 
conditions and (B) an ESS equilibrium, and 
(C) empirical results. Orange lines (and circles) 
correspond to a focal plant that is interacting 
with a neighbor (yellow), and blue lines (and 
circles) correspond to solitary plant, both 
centered at the zero spatial coordinate. 
Areas shaded with vertical bars indicate the 
differences between the root systems of the 
solitary and the focal interacting plants at 
each distance from their insertion to the 
substrate, orange bars indicate local root 
overproduction, and blue bars indicate local 
root underproliferation of the focal interacting 
plant. The black cross indicates the position 
of the shifting point (S). In (c), small circles 
represent the data, large circles represent 
the mean root density at a given spatial 
coordinate, and lines represent linear 
interpolations between the means. See 
materials and methods (supplementary 
material) for details. 
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optimize crop resource-use efficiency and max- 
imize food production (4). Additionally, plant 
roots represent a large carbon pool that stores 
approximately a third of the world’s phyto- 
mass (5), and understanding how roots re- 
spond to belowground competition is critical 
to predict vegetation carbon uptake and to 
design strategies to mitigate climate change 
(6). Here, we develop a game-theoretic model 
that predicts how one plant’s roots will re- 
spond to roots of nearby plants. We also pres- 
ent a dataset describing spatial distributions 
of root density for individual plants, and how 
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these respond to competition, confirming mod- 
el predictions. 

Plants can sense soil resources (7) and the 
presence of nonself (8) and nonkin roots (9) 
and respond phenotypically to these stimuli 
(10). However, published data have been con- 
tradictory about the direction of the below- 
ground plastic responses to the presence of 
competitors and unclear about the mecha- 
nisms. Studies of changes in vertical or hori- 
zontal root foraging ranges often conclude 
that neighboring plants tend to minimize the 
overlap between their root systems (root seg- 
regation) (77). A second group of studies ex- 
amines the effect of competition on the total 
carbon allocated to roots (12). Among these, 
some report a decrease in plants’ allocation of 
carbon to roots (13, 14), presumably in accord- 
ance with root segregation; others report an 
increase (15-17); and still others find no sig- 
nificant effect (18, 19). Neither of these ap- 
proaches is complete. The former considers 
only the foraging range but neglects the dis- 
tribution of root density within that range, 
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and the latter ignores the spatial distribu- 
tion of roots. Few have considered both fea- 
tures of root systems together [but see (20)]. 
Theoretical studies have added to the con- 
fusion in the literature. Some modeling studies 
rely on simple optimization, in which roots are 
assumed to equalize average nutrient uptake 
per unit rooting effort across space (27). Others 
incorporate game theory (22) and predict 
that plants will engage in a belowground 
tragedy of the commons (23-25) sensu Hardin 
(26) and should thus overallocate to roots in 
the presence of a neighbor. Game-theoretical 
approaches, like the empirical ones, have gen- 
erally not looked simultaneously at total allo- 
cation and spatial distribution [but see (27) ]. 
We developed a spatially explicit game- 
theoretical framework describing plant com- 
petition for a soil resource (supplementary 
materials). Our model explicitly incorporates 
soil resource dynamics and a distance-dependent 
foraging cost. The fitness cost of accessing a 
soil patch at a distance from the stem accounts 
for the cost of proliferating and maintaining 
absorbing roots inside the patch and trans- 
porting roots from the patch to the stem. 
Game-theoretic equilibrium solutions can be 
obtained either by assuming that plants adopt 
the strategy that produces the highest possible 
fitness for the collective (cooperative equilib- 
rium) or by maximizing their competitive 
ability [evolutionary stable strategy (ESS)] 
(28). Cooperative fitness maximization implies 
that plants should collectively maximize re- 
source uptake per unit of root cost (a fitness 
surrogate). To do so, they would need either 
to directly recognize nonself roots and their 
cost to the neighbor or to have been artificially 
selected to deploy roots that maximize collec- 
tive production given a fixed spatial pattern of 
individuals. This strategy represents a poten- 
tial target for crop optimization. In the ESS, 
each individual plant maximizes its own up- 
take per unit cost. This is the plant strategy 
that presumably evolves by natural selection. 
To analyze the phenotypical response of 
plants to competition, we compared root spa- 
tial distributions predicted by a model of a 
solitary plant (bell-shaped root distribution 
centered at the insertion to the substrate) 
and a plant facing a neighbor. When plants 
in competition were assumed to follow a 
cooperative strategy (Fig. 1A), root behavior 
showed cooperative segregation; that is, each 
plant withdrew from any location where it 
sensed nonself roots having lower exploitation 
costs. However, when plants were assumed to 
embrace an ESS, the model predicted exploit- 
ative segregation; that is, the range of the root 
system was shortened on the side facing a 
neighbor, but plants locally overproliferated 
roots close to their own insertion point (Fig. 
1B; see supplementary materials for repre- 
sentations in three dimensional soil). The 
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Fig. 2. ESS model prediction of a plant’s total root biomass (integral of the spatial distribution of 
root density). The blue line indicates a solitary plant andthe orange line indicates a focal interacting plant, at 
increasing distance from an interacting neighbor (d, centimeters). The orange-hatched area indicates an 
increase and blue-hatched area a decrease in total plant root biomass as a response to the presence of the 
neighbor at each distance. (Insets) The root density distribution in space of plants at d = O cm (maximum 
total biomass increase) and at d = 30 cm (total biomass decreases). Insets’ color codes, abscissa axis 
values, and model parameterization are the same as in Fig. 1. See materials and methods (supplementary 


material) for details. 


shifting point S defines the distance from 
the plant-shoot insertion to the substrate at 
which the strategy shifts from root overpro- 
liferation to underproliferation. Both the over- 
and underproliferation predicted by the ESS 
simply optimize a plants resource economy 
when in competition and do not require direct 
sensing of nonself roots. 

For a solitary plant, the benefit of proliferat- 
ing roots in a given location decelerates as 
local root density increases, because, even with- 
in a single plant’s root network, individual 
root branches compete with one another for 
the same resource. As roots proliferate, the so- 
litary plant gains more total resource but also 
steals more resource from itself. When a neigh- 
bor is present, two opposing forces are at 
work: (i) Some of the stolen resource comes 
from the neighbor, promoting overprolifera- 
tion. (ii) Increased proliferation of a neighbor’s 
roots depletes soil resource, promoting under- 
proliferation. The cost of growing a unit of 
absorbing root surface increases with distance 
from the plant because the root surface must 
be connected to the stem by a xylem conduit. 
When the local cost of roots is low, force (i) 
dominates and plants should overproliferate 
roots. However, in locations far from a focal 
plant but close to its neighbor, force (ii) dom- 
inates, leading to local underproliferation by 
the focal plant. 
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Exploitative segregation can explain seem- 
ingly contradictory results reported in the lite- 
rature. When two competing plants are very 
close to each other, plants should respond to 
competition by increasing biomass allocation 
to roots (Fig. 2) and should exhibit a root trag- 
edy of the commons. However, as plants are 
set farther apart, local underproliferation in 
distant areas compensates for local overprolif- 
eration near the plant, resulting in a slight de- 
crease in total root allocation. Also, as in the 
literature, such plants should exhibit range 
segregation by decreasing the total spatial 
spread of their root systems. 

We tested these predictions in a greenhouse 
experiment using a traditional cultivar variety 
of pepper plant (Capsicum annuum L. var. 
luesia, Solanaceae) (supplementary materials). 
Plants were grown in gutters alone or paired 
with a competitor located at 10-cm distance. 
Root systems of competing individuals were 
stained in situ with different colors, and the 
spatial distribution of root density of each 
plant was mapped in the horizontal axis of the 
gutter. We thereby obtained data integrating 
spatial information and plant allocation strategy. 

We did not observe a root tragedy of the 
commons at that given plant-plant distance, as 
there were no significant differences in total 
root biomass (t = —0.9422, P = 0.3573), shoot: 
root ratio (U = 116, P = 0.9025), or reproductive 
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yield (¢ = —0.6177, P = 0.5417) between solitary 
and competing plants. We detected evidence 
for root segregation, as we found significant 
differences (¢ = —5.3362, P < 0.0001) between 
the observed root overlap in interacting plants 
(20 cm of overlap) and the expected overlap 
between solitary-like distributed root systems 
(40 cm). Plants exhibited an exploitative seg- 
regation response to competitors (Fig. Ic) as 
predicted by the ESS equilibrium of our mod- 
el. Each plant overproliferated roots near to 
its own stem and underproliferated roots 
closer to its neighbor’s stem than to its own. 
Both of these effects are statistically signifi- 
cant: overproliferation of roots with increas- 
ing competitor root density (F = 11.6013, P < 
0.001) and negative interaction between length 
and competitor root density (F = 58.2655, P < 
0.001). This negative interaction indicates that 
plants responded to competitor root density 
by shifting from local overproliferation at low 
transporting lengths to underproliferation as 
the transporting length between the plant 
shoot and the soil patch increased. 

We thus demonstrated experimentally the 
predictions of our ESS model: Plants do both 
locally overproliferate roots near their stem 
and reduce their foraging range when it over- 
laps with a neighboring root system. By inte- 
grating root allocation and spatial information, 
the exploitative segregation theory reconciles 
hypotheses that have been largely considered 
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contradictory: root segregation and the trag- 
edy of the commons. 
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Directly visualizing the momentum-forbidden dark 
excitons and their dynamics in atomically 


thin semiconductors 


Julien Madéo*, Michael K. L. Man'*, Chakradhar Sahoo", Marshall Campbell?, Vivek Pareek’, 
E. Laine Wong"+, Abdullah Al-Mahboob!, Nicholas S. Chant, Arka Karmakar’, 
Bala Murali Krishna Mariserla’*, Xiaogin Li*, Tony F. Heinz>°, Ting Cao”, Keshav M. Danit} 


Resolving momentum degrees of freedom of excitons, which are electron-hole pairs bound by the 
Coulomb attraction in a photoexcited semiconductor, has remained an elusive goal for decades. In 
atomically thin semiconductors, such a capability could probe the momentum-forbidden dark excitons, 
which critically affect proposed opto-electronic technologies but are not directly accessible using optical 
techniques. Here, we probed the momentum state of excitons in a tungsten diselenide monolayer by 
photoemitting their constituent electrons and resolving them in time, momentum, and energy. We 
obtained a direct visual of the momentum-forbidden dark excitons and studied their properties, including 
their near degeneracy with bright excitons and their formation pathways in the energy-momentum 
landscape. These dark excitons dominated the excited-state distribution, a surprising finding that 
highlights their importance in atomically thin semiconductors. 


he discovery of two-dimensional (2D) 
semiconductors launched exciting op- 
portunities in exploring excited-state 
physics and opto-electronic technolo- 
gies (J-3), driven in part by the exis- 
tence of robust, few-particle excitonic states. 
As a prototypical 2D semiconductor in the 
transition metal dichalcogenide (TMD) family, 
WSe, monolayers exhibit a band structure 
that hosts two degenerate valence band max- 
ima but eight nearly degenerate conduction 
band minima in the hexagonal Brillouin zone 
(BZ) (4, 5). At the K and K’ valleys, the con- 
duction and valence band energies are both 
at local extrema, giving rise to two direct 
bandgap transitions and the bright excitons 
(denoted as K-K excitons) (Fig. 1, A and B). 
These excitons, also called direct excitons in 
the traditional semiconductor literature, are 
behind the strong light absorption and photo- 
luminescence in the WSe, monolayer (6, 7) 
and have been extensively investigated in var- 
ious optical spectroscopy experiments. 
Few experiments, however, have been capa- 
ble of probing the indirect or momentum- 
forbidden dark excitons in monolayer TMDs, 
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which consist of an electron and a hole resid- 
ing at different valleys (8). Such dark excitons 
may interact with bright excitons, serve as the 
preferred carriers of information and energy, 
or form collective states such as exciton liquids 
and condensates (9, 10). Therefore, determin- 
ing the properties and controlling the popula- 
tion of the dark excitons, as well as their 
interactions with the bright excitons, is the 
key to a complete understanding of the un- 
derlying physics and developing future tech- 
nologies. Because of the six other conduction 
band minima at the Q valleys (also denoted as 
A or È in the literature) in WSe, monolayers, 
dark excitons may form with an electron in 
the Q valley and a hole in the K (or K’) valley 
(Fig. 1A) (5, 17). Compared with other TMDs, a 
near degeneracy between the Q- and K-valley 
states of WSe, has been predicted (12), fur- 
thering the possibility that these dark exci- 
tons play an important role in the optical 
properties (13). The crystal momentum mis- 
match between the electrons and holes makes 
them inaccessible in first-order optical processes 
such as absorption and photoluminescence 
(14, 15). 

Momentum-resolved studies of excitons 
have been a long-standing goal (16-23). Such 
studies would provide the resolution to di- 
rectly access the recently sought-after dark 
excitons in monolayer TMDs (14, 24). In gen- 
eral, angle-resolved photoemission spectros- 
copy (ARPES)-based techniques have been 
one of the most successful in providing mo- 
mentum information (25). For example, ARPES 
techniques have successfully probed free car- 
riers in bulk TMDs (26-28) and specially pre- 
pared monolayers (30-32). However, observing 
strongly bound, few-particle excitonic states 
is not straightforward even conceptually, as 
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discussed in a number of recent theoretical 
studies (19-23). Experimentally, serious chal- 
lenges include the need for high-energy ex- 
treme ultraviolet (XUV) photons to access 
states at the BZ vertices (XUV-ARPES), spatial 
resolution to study the typical high-quality 
micrometer-scale TMD samples (u-ARPES) 
(8), and time resolution through a pump-probe 
configuration (TR-ARPES) to access the dy- 
namics of the short-lived excitonic states after 
photoexcitation (33-36). Here, we successfully 
overcame these experimental challenges in a 
single platform to perform TR-XUV-u-ARPES, 
providing a direct visualization of dark exci- 
tons in a WSe, monolayer. We report on dark 
exciton formation pathways under different 
photoexcitation conditions, the nature of their 
spectral degeneracy relative to bright excitons, 
and the dominant role they play in the quasi- 
equilibrium distribution at long time delays. 
Our experiments represent a milestone in 
studies of photoexcited states by providing 
a global view over the entire BZ, along with 
exclusive insights inaccessible otherwise. 

We studied an exfoliated WSe, monolayer 
placed on an hBN buffer layer supported by 
an Si substrate (Fig. 1C). The sample was probed. 
at a temperature of 90 K under ultrahigh 
vacuum conditions (details about sample 
preparation and characterization are provided 
in the supplementary materials). Our experi- 
ments are enabled by a custom-built platform 
that combines an ultrafast, tabletop XUV 
source with a spatially resolving photoemis- 
sion electron microscope (PEEM) (Fig. 1D and 
supplementary text, section 2). First, we per- 
formed an XUV-u-ARPES measurement to 
obtain the band structure of the nonphotoex- 
cited WSe, monolayer (Fig. 2A). For this, we 
used ultrafast XUV probe pulses (21.7 eV) to 
photoemit electrons from the sample. Using 
the high-resolution spatial imaging capabil- 
ities of our PEEM apparatus, we isolated photo- 
electrons emitted from only the monolayer 
region of the sample (see the supplementary 
text, section 3). The reciprocal images of these 
photoelectrons were then dispersed in energy 
in a time-of-flight detector, thereby resolv- 
ing the angle (i.e., momentum) and energy 
(with an energy resolution of 0.03 eV) of each 
photoelectron (37, 38) (Fig. 1D and supple- 
mentary text, sections 2 to 4). The measured 
band structure showed the spin-split valence 
band extrema at the K,K’ valleys and had 
excellent agreement with GW calculations 
(Fig. 2A and materials and methods). The 
energy differences between the K- and Q- 
valley conduction bands were sensitive to 
the lattice parameters and functional used 
in the calculation (see the materials and 
methods). 

Next, to measure the excitonic states of the 
TMD monolayer, we excited the sample with 
an ultrafast pump pulse tunable over the 
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Fig. 1. Time-resolved-XUV-p-ARPES of 
excitons in a WSez monolayer. (A) Left: 
Diagram showing the configuration for 

K-K direct excitons and Q-K indirect excitons 
with holes located in the K valleys and 
electrons in K and Q valleys, respectively. 
Right: Representation of the exciton photo- 
emission process. XUV photons photoemit 
electrons, leaving holes from the bound 
electron-hole pairs. (B) Schematic of the 
k-space structure of monolayer WSe» showing 
the first BZ composed of six Q valleys and 
two K valleys (each K valley is shared between 
three BZs, so only one-third of each falls 
within the first BZ). (©) Optical image of 

the sample composed of a monolayer 

WSe> (blue outline) on hBN (red outline) 

on an n-doped Si substrate. Inset is a 

side view of the sample. (D) Schematic of 
the experimental setup. 


Fig. 2. Excitonic resonances. (A) Experimental 
and theoretical XUV-u-ARPES results from the 
WSes monolayer without optical excitation. 

The dispersion of the occupied quasiparticle 
bands (false color scale of electron emission 
intensity) is shown, together with the calculated 
band structure plotted along T-Q-K-M (dashed 
white lines). (B) Experimental band structure 
with a 1.72-eV photoexcitation resonant with the 
A exciton at a 0.5-ps delay showing below 
conduction band direct and indirect excitonic 
States. The experimental data were compared 
with nonrenormalized conduction bands with 
exciton-induced bandgap renormalization effects 
estimated to be <100 meV (20). (C) Top: 
Photoemission intensity integrated from 1 to 

3 eV above the valence band maximum (VBM) 
versus optical pump energy. Seen clearly is the 
spectrum dominated by resonance at 1.73, 

2.17, and 2.48 eV, corresponding to the A, B, and 
C excitons described in the literature, respec- 
tively. The observed resonance at low energy Is 
attributed to the trion peak (8). Bottom: Spectrum 
of the resonant optical pump (red) and photo- 
emission energy spectrum integrated within 

a +0.015 A? k-space range at the center of the 
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K valley at zero time delay (gray) and also at later time delays (green). (D) Corresponding momentum-resolved photoemission intensity integrated from 1 to 
3 eV above the VBM showing the exciton signals in the K valley for a resonant excitation with the A exciton at zero time delay. The observed asymmetry in the 
intensity distribution is largely caused by the matrix element effect determined by the polarization and incidence angle of the photoemission probe. 


visible and near-infrared range of the spec- 
trum. Then, the ultrafast XUV probe pulse 
discussed above was introduced at a variable 
time delay to measure the time-, angle-, and 
energy-resolved photoelectron spectrum from 
the WSe, monolayer (TR-XUV-u-ARPES). Re- 
cent theoretical studies have predicted photo- 
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emission signals from excitons exhibiting 
an energy-momentum distribution centered 
in the corresponding conduction band valley 
but binding energy below the conduction 
band minimum. In our measurements, after 
resonant photoexcitation of the A exciton, a 
substantial photoemission signal at positive 
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time delay was seen below the bandgap cen- 
tered at the K (K’), located at 1.26 A, and Q 
valleys, located at +0.75 A’ (Fig. 2B) (21). To 
ensure that the detected signals corresponded 
to excitonic states, we measured the photo- 
emission excitation spectrum (Fig. 2C, top 
panel), i.e., the integrated photoemission 
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Fig. 3. Exciton dynamics after resonant exci- 
tation. (A) TR-ARPES data for delay times of 

O, 0.3, and 1.5 ps for the full 2D BZ (in-plane) and 
emission energy (vertical). The top of the valence 
bands is displayed in grayscale; emission from 
the K-valley (Q-valley) excitons is plotted in blue 
(red) dots (see the supplementary materials). 
The black dashed line defines the boundary of 
the first BZ. (B) TR-ARPES data for delay times 
of O, 0.3, and 1.5 ps along the line connecting the 
K and Q valleys. The dotted lines show the 
calculated (spin-split) conduction bands. 

(C) Exciton density versus time delay at Q valleys 
(red) and K valleys (blue). The exciton density 
was determined by the ARPES signal integrated 
over the two K valleys or six Q valleys of the 
first BZ and an energy range from 1 to 2.5 eV, 
with a correction factor for the respective 
photoemission matrix elements. The dotted black 
line shows the instrument response function, 
i.e., the convolution of the pump and probe pulse. 
The corresponding Gaussian pulse is plotted in 
green. Inset is the ratio of the K/Q population 
(black) obtained by dividing the data in Fig. 3C 
showing dominant K-valley exciton population 
initially but a ratio approaching ~0.5 at longer 
time delays. The blue and red curves correspond 
to the center energy of a Gaussian fit of the 
distributions of the K- and Q-valley signals, 
respectively. (D) Time-resolved spectrum of the 
K-valley signal integrated over the first BZ. (E) 
Time-resolved spectrum of the Q-valley signal 
integrated over the first BZ. K-valley excitons are 
populated directly upon photoexcitation. We 
observed a clear delay in the rise of the Q-valley 
population caused by K to Q scattering. 


intensity (from 1 to 2.5 eV above VBI in the 
first BZ), versus the optical excitation energy 
(varied from 1.58 to 2.85 eV). In the photo- 
emission excitation spectrum, we clearly ob- 
served the distinctive A-, B-, and C-exciton 
resonances as previously reported in optical 
absorption (6). This energy dependence con- 
firms the dominance of excitons under the 
experimental conditions used in this work, 
namely photoexcitation density, sample struc- 
ture, and photoemission probe. Then, we 
tuned our pump pulse to 1.72 eV to match 
the A-exciton resonance as shown in the bot- 
tom panel of Fig. 2C. This choice of pump 
wavelength ensured that we were predo- 
minantly and resonantly exciting excitons. 
Finally, looking at the energy- and momentum- 
resolved photoemission signal under these 
pump conditions, we clearly saw a signal in 
the K valley and also at exactly the energy of 
the A exciton. This confirms that the photo- 
emission signal at ~1.73 eV, located at +1.26 A” 
along the K-T cut (.e., in the K-valley), cor- 
responds to excitons in the K valley, in agree- 
ment with previous theoretical calculations 
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(39) and optical experiments (40). By exten- 
sion, given the expected near degeneracy, we 
attribute the signal at 1.73 eV and +0.75 A” 
momentum along the K-T cut to excitons in 
the Q valley. We assigned these two signals 
in part to the bright K-K and momentum- 
forbidden dark Q-K excitons, respectively (see 
the supplementary text, section 7). Given the 
large inhomogeneous broadening (~250 meV) 
in the photoemission spectrum of our sample 
(see the supplementary text, section 5), we 
expect that the K-valley signal also constitutes 
other excitonic states that are nearly resonant 
with the bright K-K exciton, such as the spin- 
dark K-K exciton, the indirect K-K’ exciton, as 
well as trions and biexcitons. Similar consid- 
erations also apply for the Q-valley signal. In 
the following, we will refer to the ensemble of 
excitonic states that constitutes the K- and 
Q-valley signals as K- and Q-valley excitons. 
We note that the calculated single-particle 
band structure in this work does not include 
renormalization effects caused by the exciton 
density, but we estimate this renormalization 
to be <100 meV (20). We also observed the K- 
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and Q-valley excitons with electrons at the 
K,K' and Q valleys and the presence of holes 
that could be seen by the depletion of elec- 
trons in the K,K’ valley by taking the differ- 
ence between the band structures without 
and after photoexcitation (fig. S14A). The 
photoemission spectrum taken at different 
time delays (fig. S14B) after photoexcitation 
then allowed us to follow the formation dy- 
namics of these dark indirect excitons and 
learn other aspects of their nature. To elim- 
inate rigid energy shifts or offsets of the 
entire band structure caused by surface pho- 
tovoltage effect or other similar phenomena, 
we set the peak of the upper spin-split va- 
lence band as the zero-energy reference for 
every time delay (see the supplementary text, 
section 8). We also used an optical excita- 
tion spot much larger than the sample to 
eliminate any lateral contribution to surface 
photovoltage caused by local variations of 
intensity (41). 

To study the formation dynamics, we next 
resonantly excited the optically allowed K-K 
exciton as seen in Fig. 2C. We used linearly 
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Fig. 4. Exciton formation and dynamics 
after above-gap excitation. (A) 3D plots of 
the experimental data at 0.3-, 0.5-, and 1.5-ps 
time delays. The top of the valence bands is 
displayed in grayscale. The K-valley (Q-valley) 
excitons are plotted in blue (red) dots. The 
black dashed line defines the boundary 

of the first BZ. (B) ARPES data at the K and 
Q valleys for 0.3-, 0.5-, and 1.5-ps time delays. 
Dotted line is the theoretical calculation 

of the conduction bands. (C) Top: K-valley 
(blue) and Q-valley (red) exciton density over 
the first BZ from 1 to 2 eV. Bottom: Center 
energy of the photoexcited population 
obtained from fits with Gaussian function 

at the center of the K and Q valley. Inset is 
the ratio of the total populations between 
the K and Q valleys extracted from the top 
panel. (D) Time-resolved spectra at the K 
valley (top) and the Q valley (bottom) showing 
the formation and relaxation of excitons. 


polarized pump pulses at 1.72 eV with a fluence 
chosen to produce an estimated exciton den- 
sity of 2.4 x 10'*/cm? (see the materials and 
methods and supplementary text, section 16). 
At zero-time delay, we observed only the K- 
valley excitons (Fig. 3, A and B). As expected 
for resonant excitation, the formation of K- 
valley excitons was rapid, as seen in Fig. 3C. 
The energy of the K-valley exciton (1.73 + 
0.03 eV) was consistent with the energy of 
the pump excitation, within experimental 
uncertainty, and did not change for longer 
delay times (Fig. 3C). At later times, we saw 
a clear buildup of the dark Q-valley exciton 
population at energies nearly degenerate (with- 
in our 0.03-eV energy resolution) with the 
K-valley excitons (Fig. 3D). Through the mo- 
mentum sensitivity of the measurement, we 
directly observed the formation of the dark 
Q-valley excitons through scattering from the 
K-valley excitons on an ~400-fs time scale (see 
the supplementary text, section 13). Theoretical 
studies and indirect optical measurements 
have reported on phonon-assisted interval- 
ley scattering of excitons (23, 42-44) that are 
fundamentally different from the electron- 
phonon interactions seen in bulk TMDs (28). 
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Our measurements directly accessed this 
exciton-phonon scattering and are consistent 
with the reported time scales of few hundred 
femtoseconds. Given the presence of resident 
carriers from unintentional doping, exciton- 
plasmon scattering could also play a role. 
Future experiments varying the doping con- 
centration would allow one to tease out this 
contribution. The Q and K signals also showed. 
similar recovery times, which is consistent 
with previous studies showing that the dark 
Q-K exciton acts as a long-lived reservoir for 
the K-K exciton (45). The extracted decay 
times were 1.7 and 2.5 ps for K- and Q-valley 
excitons, respectively (see the supplementary 
text, section 14), reflecting an average decay 
time corresponding to all the different exci- 
tonic states probed. Future experiments using 
samples with narrower linewidths in the 
ARPES signal could allow one to distinguish 
between the different excitonic states and 
the corresponding dynamics between them. 
Also of importance is the evolution to a quasi- 
equilibrium distribution of excitons. We ini- 
tially created a larger K-valley population by 
resonant excitation with the A exciton, but 
within a picosecond, the Q-valley exciton 
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population dominated, with the K-valley/ 
Q-valley ratio tending toward ~0.5 (Fig. 3C, 
inset). We note that in calculating the ratio, 
we included the population in the entire 
first BZ comprising six Q valleys and two 
K valleys, as well as a normalization factor 
arising from the different photoemission 
matrix elements (between band states and 
photoelectron states) at the K and Q valleys 
(see the supplementary text, section 10). As- 
suming this limiting ratio reflects equilib- 
rium at the lattice temperature of 90 K and 
assuming a density of state factor on the 
order of 1 (for details, see the supplementary 
materials), one can obtain a tighter bound 
of <0.015 eV on the energy difference of the 
two exciton species. A video of the exciton 
dynamics measured over the full BZ after 
resonant photoexcitation is provided in the 
supplementary materials (movie S1). 

Finally, we turned our attention to the dy- 
namics after above-gap excitation. We used a 
2.48-eV linearly polarized pump pulse to ex- 
cite carriers well above bandgap, producing 
an estimated carrier density of 1.6 x 10’°/cm? 
(see the materials and methods and supple- 
mentary text, section 16). Unexpectedly, the 
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exciton relaxation pathways, particularly for 
the dark Q-valley excitons, were substantially 
different from those observed for resonant 
excitation. Figure 4A shows snapshots of the 
full ARPES data at different time delays at 
the K and Q valleys (movie S2). Immediately 
after excitation (zero delay), we observed a 
broad distribution in the K and Q valleys 
centered at an energy of ~1.90 eV (fig. S11) 
that could involve contributions from both 
free carriers and excited excitonic states. Fu- 
ture experiments with improved time and 
energy resolution are needed to explore these 
very early dynamics (see the supplementary 
text, section 11). Beyond this regime, we ob- 
served full relaxation into the K- or Q-valley 
excitons within 500 fs (Fig. 4B). Previously, 
optical and mid-infrared spectroscopic mea- 
surements had reported the subpicosecond 
formation of exciton dynamics (46, 47) but 
lacked separate access to the dynamics of the 
different types of excitonic states, such as the 
dark Q-valley excitons. The relaxation process 
can also be described by plotting the average 
energy of the photoemission signal versus 
time (Fig. 4C), giving an energy relaxation 
time of 500 fs. Beyond 500 fs, the peak energy 
of the distribution at the Q and K valleys 
remained constant at ~1.73 eV, matching the 
exciton energies under resonant conditions 
(Figs. 3C, inset, and 4C). A marked departure 
from the resonant excitation case is that Q- 
valley excitons appeared coincident with the 
K-valley excitons (rather than at a finite delay 
after scattering of the K-valley excitons as for 
the resonant excitation). We also observed 
that the dark Q-valley exciton density domi- 
nated the direct K-valley density at all time 
delays (Fig. 4C, and see inset), in contrast to 
the resonant excitation case. However, at long 
time delays, under both resonant and above- 
gap excitation, the system evolved to a similar 
quasi-steady state, with a nearly identical K- 
valley/Q-valley exciton population ratio and 
nearly degenerate exciton energies with re- 
spect to the valence band maximum. The 
exciton-binding energy is given by the dif- 
ference between the conduction band mini- 
mum and the energy of the constituent electron 
photoemitted from the center of the valley at 
long time delays, where quasi-equilibrium is 
reached (44). Accordingly, we estimated bind- 
ing energies as ~390 and ~480 meV (with 
respect to the conduction band minima) for 
the K-valley and Q-valley excitons, respectively, 
with a combined uncertainty of ~150 meV 
caused by theoretical errors, bandgap renor- 
malization effects, and experimental uncer- 
tainty. Although the former can be compared 
to results of various optical spectroscopy 
measurements (3), the binding energy and 
momentum-space distributions of the dark 
exciton in monolayer TMDs are not easily 
accessible to other experiments (23). 
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Our measurements using TR-XUV-u-ARPES 
to access strongly bound, few-particle exci- 
tonic states in 2D semiconductors and their 
dynamics open new possibilities. Such direct 
access to dark excitons or other valley- and 
spin-polarized excitons will enable their utility 
in quantum information (48) and valleytronic 
and spintronic schemes and in creating new 
many-body excitonic states (9, 10). Energy- and 
momentum-resolved photoemission studies of 
excitons could directly image excitonic wave- 
functions in momentum space. On the basis 
of the energy-momentum dispersion rela- 
tionship, one could measure important phys- 
ical properties, such as the kinetic energy and 
temperature of photoexcited excitons (20-23). 
Future measurements could access few-particle 
excitations, such as trions, biexcitons, and 
interlayer excitons in TMD heterostructures, 
which may be expected to have their own 
specific photoemission signatures. Also, the 
use of circularly polarized excitation would 
provide direct access to the nature of K to K’ 
exciton scattering, critical information in the 
context of valleytronics. Finally, we expect our 
measurements to extend to other condensed- 
matter systems in providing a complete pic- 
ture of the transformation of their electronic 
structure along energy and momentum axes 
after optical illumination. 
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Super-resolution lightwave tomography of electronic 
bands in quantum materials 


M. Borsch?*, C. P. Schmid?*, L. Weigl?, S. Schlauderer*, N. Hofmann“, C. Lange*t, J. T. Steiner’, 


S. W. Koch?, R. Huber*t, M. Kira’t 


Searching for quantum functionalities requires access to the electronic structure, constituting the 
foundation of exquisite spin-valley—electronic, topological, and many-body effects. All-optical band- 
structure reconstruction could directly connect electronic structure with the coveted quantum 
phenomena if strong lightwaves transported localized electrons within preselected bands. Here, we 
demonstrate that harmonic sideband (HSB) generation in monolayer tungsten diselenide creates distinct 
electronic interference combs in momentum space. Locating these momentum combs in spectroscopy 
enables super-resolution tomography of key band-structure details in situ. We experimentally tuned 
the optical-driver frequency by a full octave and show that the predicted super-resolution manifests in a 
critical intensity and frequency dependence of HSBs. Our concept offers a practical, all-optical, 

fully three-dimensional tomography of electronic structure even in microscopically small quantum 


materials, band by band. 


and-structure engineering (J-3) strives 

to design the dependence of energies 

and geometric phases (4) of electronic 
states of solids on the wave vector, K. 
These attributes ultimately control emer- 

gent quantum phenomena, ranging from ex- 
citons (Coulomb-bound electron-hole pairs) 
(5, 6) to many-body complexes (7) and quantum 
phase transitions (8). Monolayers of transition- 
metal dichalcogenides, such as tungsten dis- 
elenide (WSe,), host a bounty of such quantum 
effects (1, 6, 8, 9). Angle-resolved photoelec- 
tron spectroscopy has championed electronic 
band-structure mapping (JO-13) in samples 
of sufficient size, measured at ultrahigh vacu- 
um conditions. All-optical techniques, such 
as harmonic sideband (HSB) (4-17) and high- 
harmonic generation (HHG) (J8-27), promise 
in situ probing of even microscopically small 
and/or atomically thin solids in ambient. Yet, 
HHG mapping encounters challenges: Atom- 
ically strong lightwaves inherently spread elec- 
trons broadly throughout the Brillouin zone 
(BZ) (20) and induce electronic interferences 
(21) as well as transport among multiple bands. 
Such contributions become disentangled in 
HSB generation, in which an optical field se- 
lects electronic bands by resonantly exciting co- 
herences between a specific valence-conduction 
band pair, creating an electron-hole (e-/) ex- 
citation (Fig. 1A, top red lines). If tuned in 
resonance with the 1s A exciton of monolayer 
WSe,, the pulse selectively prepares an exci- 


‘Department of Electrical Engineering and Computer Science, 
University of Michigan, Ann Arbor, MI, USA. Department of 
Physics, University of Regensburg, Regensburg, Germany. 
3Department of Physics, University of Marburg, Marburg, Germany. 
*These authors contributed equally to this work. 

tPresent address: Fakultat Physik, Technische Universitat Dortmund, 
Otto-Hahn-StraBe 4, 44227 Dortmund, Germany. 

Corresponding author. Email: mackkira@umich.edu (M.K.); 
rupert.huber@ur.de (R.H.) 


Borsch et al., Science 3'70, 1204-1207 (2020) 


tonic wave packet defined by the microscopic 
interband polarization P;, (Fig. 1A, color map 
at time t = O) close to the K point where the 
energy difference E¢" (Fig. 1A, bottom red line) 
between the two bands is minimal. Unlike in 
HHG, Px is strongly localized and oscillates 
with the frequency vis corresponding to the 
1s-exciton energy, Avis (where h is Planck’s 
constant). A strong multi-terahertz field at a 
frequency vry, < Eg" /hthen creates HSBs by 
transporting Px from the K point to k (Fig. 1A, 
color map at t > 0). The resulting increase, 
Ee" — Eẹ” p in the energy of Px is emitted as 


A 
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HSB radiation, connecting the HSB spectrum 
with the band structure. Considerable spreading 
of P,,—although much weaker than in HHG— 
and many-body contributions to the emission 
energy may complicate a direct inference of 
E” from HSB spectra. Nevertheless, the HSB 
emission can be elegantly linked with E¢" be- 
cause the polarization wave packet Px starts 
from the K point and is almost sinusoidally 
translated by the multiterahertz field (28). 
The connection of the HSB spectrum with this 
well-synchronized motion and Ef" becomes 
apparent when Pk(t) = PEt (t)e ~? ovt is an- 
alyzed in a frame rotating with the optical 
excitation frequency Vopt- The nth HSB is gen- 
erated if PO'(t) oscillates with the frequency 
Vn = N Vrug Which occurs only if P(t) has 
been translated to a K value where the HSB- 
photon energy, EHB = hvopt + Avn, matches 
the energy of P—that is, EĦ®S = Ef". As illus- 
trated in Fig. 1B for the 5th HSB, the oscillat- 
ing polarization, Pi (t) crosses a given k value 
twice during a single multiterahertz cycle. In 
a simplified picture, the multiterahertz field 
purely translates PK’ (t) = Py, with k(t) = 
k — 2r|e|/h |! „dt' Erpz(t') so that these two 
crossings (sparks) can be viewed as separate 
polarization bursts, adding up as a coherent 
superposition to the total spectral component 


PI (vs) — fdt Pe (1) 
The phase evolution between these bursts is 


defined by the spectrum-projecting kernel, 
e+ ?tvsl which oscillates five times faster than 


Wave vector, k, 
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Fig. 1. Harmonic sideband generation and crystal momentum combs. (A) Band structure (top) of 
monolayer WSez and typical exciton wave-packet dynamics (bottom) (color map |P; (t)| of many-body computation) 
in HSB. The red lines indicate the e-h bands, and the thick black line indicates the driving multiterahertz field 
(cycle duration, Try, = viz), projected onto the ai surface (gray). (B) Schematics of a |P(°'(t)| (Gaussians) 
on a sinusoidal path (transparent area) crossing a given k point twice during a multiterahertz cycle. These 
crossings (sparks) yield HSB-emission contributions interfering as waves (blue shaded areas). The screen 
compares k combs (blue-white color maps) with their even parts (red-yellow color maps) for Oth to 5th HSBs on 


the basis of full many-body computations. 
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the K(¢) trajectory (Fig. 1B, light blue area). 
Hence, there are, respectively, six or five k val- 
ues where the first and second bursts are in 
(Fig. 1B, solid line) or out of phase (Fig. 1B, 
dashed line), leading to interference max- 
ima or minima. Generally, Pj°'(v,) exhibits a 
k-space interference pattern with (nm + 1) 
maxima (screen), which we call a K comb in 
analogy to frequency combs (29, 30) created 
by multiple time-domain pulses. Because k 
combs are the projections of P(t) (having 
the energy E”) onto a given Av», their ampli- 
tude decreases with increasing HSB-to-band 
energy difference. 

In reality, Pj°'(t) cannot be reduced to a 
simple k(t) translation because many-body 
interactions distort the path, dephase the po- 
larization, and renormalize the emission en- 
ergies. We performed full quantum-kinetic 
computations with a state-of-the-art dynamic 
cluster-expansion approach (supplementary 
materials) (5) to systematically predict the 
HSB properties on the basis of fundamental 
electronic band structure, geometric phase 
effects, and the strong light-matter as well as 
the many-body Coulomb interactions. Although 
many-body and interference effects spread 
Px and create new oscillating features (Fig. 1A, 
blue regions), the (n + 1)-peak structure of 
k combs remains robust (Fig. 1B, blue color 
maps). At the K point, all k combs of odd 
order exhibit a node, whereas those of even 
order feature a peak, implying a dominantly 
odd and even symmetry, respectively. This 
symmetry can be broken only by transport- 
ing Pi” (t) to inversion asymmetric regions of 
the bands where Eg" z E%.. 

Because Px defines the coherent transition 
amplitude for e-h recombination (5) associated 
with the dipole moment dx, its symmetry af- 
fects the total HSB intensity 


Tuse (v) 0 |(vop +V) Y pe PEO] O) 


where the k sum includes the entire BZ. For 
symmetric systems (E = E®h, and dk = dx), 
odd-order sidebands vanish because the odd- 
order k combs eliminate the total sum in Jysp(v). 
Monolayer WSe, has inversion symmetric bands 
close to the K point, with asymmetric features 
(Efe = E% ) emerging toward the M point. 
Only the even part of each comb, Py" (vn) = 
5 Pio’ (vn) + P*%(Vn)], contributes to Lysp(vn), 
assuming that dp is symmetric. Py" (vn) (Fig. 
1B, red color map) for n = 0, 2, and 4 closely 
follows the respective k comb (Fig. 1B, blue color 
map). In stark contrast, odd orders produce 
a PẸ (vn) that vanishes over the inversion- 
symmetric region of the bands and peaks 
sharply at the onset of the band asymmetry. 
Thus, odd-order sidebands solely originate 
from the outermost comb line. Although the 
Py" (vn) of the even HSBs are not localized, 
their k combs have a sharp leading edge whose 
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energy and momentum overlap with the band 
structure determines the strength of the re- 
spective HSB emission. In any case, the posi- 
tion of the outermost comb line locates the 
origin of HSB at 


k(n) 


which can be interpreted as the maximum 
excursion of a classical trajectory times a many- 
body quantum correction C,,. The latter typ- 
ically assumes a value between 0.5 and 1.5. 
The super-resolution follows as the width of 
the outermost comb line locates the nth HSB 
emission to an (n + 1) times narrower K range 
than the excursion scale of the wave packets 
(supplementary materials), which improves 
the K resolution linearly with n. The vectorial 
position of the k comb is assigned by k(n) 
times the direction of the multiterahertz field. 

To show that super-resolution prominently 
manifests in experimental HSB spectra, we 
resonantly excited the 1s A exciton resonance 
in monolayer WSe, with an optical pulse (du- 
ration, 100 fs) centered at the photon energy 
Of Avopt = 1.665 eV. A linearly polarized multi- 
terahertz transient centered at vraz = 42 THz 
accelerates Pk (supplementary materials) along 
the T -K direction. We controlled k(n) by tuning 
the peak electric field ER** from 7 to 19 MV 
cm™ (in vacuum). The shape of the HSB spec- 
tra (Fig. 2A) changes strongly as function of 
Eve’ and covers the entire visible spectrum. 
The third-order sideband emerges only for 
the highest fields, whereas the 5th order sets 
in steeply at El. — 7 MV cm“, reaches its 
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maximum at 13 MV cm”, and decreases for 
yet stronger fields (Fig. 2C). The 4th order 
exhibits a nonmonotonic scaling, too (sup- 
plementary materials), albeit not as steep as 
that of the 5th. 

To connect this characteristic scaling behav- 
ior with the k combs, we compared the experi- 
ment with our full many-body computations. 
All HSB spectra (Fig. 2B) and the intensity 
scaling agree quantitatively, as shown for the 
5th-order sideband in Fig. 2C. This allows us 
to faithfully assign a specific crystal momen- 
tum (5) (Fig. 2C, bottom scale) to each Le 
value (Fig. 2C, top scale) using Eq. 3. The 
computed |P¢"°"(vs)|" combs (Fig. 2D, color 
map) and k(5) (Fig. 2D, blue lines) are shown 
in Fig. 2D as a function of ERs. In Fig. 2E, 
the corresponding dispersion relation Ee 
(dark area) is superimposed with ES (dotted 
line) to easily identify the band asymmetry 
(E « E*" ), The onset of the 5th HSB inten- 
sity (Fig. 2C) occurs precisely when (5) passes 
the band asymmetry (Fig. 2, C to E, arrow), ver- 
ifying the super-resolution connection of odd- 
order HSB intensity and the position of the 
outermost comb line. Shifting the K comb 
further by larger ERs inevitably increases 
the central energy of the polarization com- 
pared with that of the 5th HSB (Fig. 2E, blue 
line), reducing the energetic overlap, and thus 
decreases the 5th HSB intensity. Such a sharp 
Ek dependent maximum in HSB intensity 
signifies a distinctive super-resolution signature 
for HSB spectroscopy. We verified similar but 
slightly less stringent super-resolution for 
even HSBs (supplementary materials). 
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Fig. 2. Super-resolution band-structure tomography with a field-strength scan. (A) Measured versus 
(B) computed HSB intensity spectra for different Ey, (shifted for visibility) at fixed vty, = 42 THz. The Sth 
HSB intensity, lysg, includes the spectral range of the yellow transparent regions. (C) Et}: dependence 
(top axis) of measured (circles and error bars) and computed (shaded area) 5th Isp. The corresponding 
k(5) is assigned in the bottom axis by using Eq. 3. (D) The computed Er}; dependence of the symmetric 
part of |Py’°"(vs)| and the comb position (blue line). (E) Electron-hole dispersion (gray area) in units of 
the K-to-K' valley separation |K' — K| and Bo. (dotted line) are connected with the HSB and k comb through 


the blue arrow. 
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To tomographically scan the 4(n)-dependent 
crossing of the k comb with Ef", we have 
developed a setup to quantitatively compare 
HSB spectra while varying vy, over a full 
optical octave, from 27 to 54 THz. The field 
strength is fixed at EP“ — 14 MV cm” to en- 
sure efficient sideband generation (Fig. 3A) 
while suppressing parasitic interband excita- 
tions by the multiterahertz field. The 5th HSB 
is essentially absent for vy, = 27 THz, sets on 
sharply for larger THz frequencies (Fig. 3C), 
and peaks at around 40 THz. The computed 
spectra (Fig. 3B) and their scaling behavior 
as function of v7, (Fig. 3B) reproduce all these 
signatures very well. The vy, dependence of 
|Pe"" (vs)|? (Fig. 3D, color map) confirms that 
the HSB peak coincides with the k-comb peak 
(Fig. 3D, blue arrow) for vy, = 42 THz, where 
the single comb line (Fig. 3D, blue line) both 
overlaps with the asymmetric region ÆE® « E%.) 
(Fig. 3D, solid versus dashed lines) and has an 
energy close to the polarization (=>), For other 
Vriz Values, the HSB intensity becomes weaker 
because the k comb either does not reach the 
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Fig. 3. Super-resolution band-structure tomogra- 
phy with a frequency scan. (A) Measured versus 
(B) computed HSB intensity spectra for different 
multiterahertz frequencies vz, (shifted for visibility), 
at fixed Erz = 14 MV/cm. The yellow regions indicate 
the spectral range of the 5th /ysp. (©) Measured 
(circles and error bars) and computed (shaded) 
5th-order lysg as function of vtz (scale left) and the 
corresponding HSB energy (scale right). (D) The 
5th HSB k-comb position (blue line), k(5) from 

Eq. 3 in units of the K-to-K’ valley separation 

|K' — K|, and symmetric part of |Py’*" (vs)? (color 
map) as function of vty; (left) and band energy 
(right) are compared with the electron-hole energy 
(solid line) and ES (dashed line). The blue arrow 
assigns the peak HSB-emission conditions. 
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asymmetric range (high vyy,) or differs sub- 
stantially from the band energy (low vrq,), 
creating a narrow vry; dependent peak in the 
5th HSB. 

These combined experimental and theoret- 
ical results herald super-resolution lightwave 
tomography: Because the k combs localize the 
HSB emission to a single (k, energy)-point 
whose connection to the experimental param- 
eters (Etuz,; Vruz) is Known, scanning (Eruz 
vraz) tomographically maps Eê}. Shown in 
Fig. 4 is the computed HSB intensity, [isp(v,,), 
resulting from a comprehensive (Eruz Vraz) 
scan, each point assigned to a single [k (n), £H] 
pair by means of Eq. 3, for n = 3, 4, 5, 6 (Fig. 4, 
A to D) together with the Eo (solid lines) and 
E% (dashed lines) bands. The edges of Fysp(vVn) 
track the entire band structure in remarkable 
detail, confirming that the overlap of the k 
combs with the band structure defines the 
strength of the HSB emission. As a stress test 
for our tomography, we measured the same 
Iasg vn) With one-dimensional Eyy, and vruz 
scans across few band-structure features and 
calibrated the experiment-theory intensities 
with a single factor that does not affect the 
assessed (Eruz; Vrz) dependence. The exper- 
imental HSB scans (Fig. 4, A to D, within 
black bars) and the computations match nearly 
perfectly. 

Also, the microscopic dipole, ad, affects the 
HSB intensity Eq. 3, which may distort the to- 
mography by overemphasizing momentum re- 
gimes with large dx. However, super-resolution 
of a k comb locates the odd-order HSBs to a 
single pair (k, energy), whereas dx changes 


lusp (norm.) 
1 0.1 0.01 


Sideband energy, nN hvyyz (eV) 


K M K 
Wave vector, k 





slowly over the k comb. Consequently, taking 
ratios of two HSB intensities, such as Rstnsra = 
Tusp(vs)/Fusp(v3), eliminates distortions by dx. 
In Fig. 4E, blue color maps show the first posi- 
tive peak of gradient & Rsh /3ra, aNd red color 
maps show the first negative peak of gradient 
Feo Roth /sra, Starting from the K point (2 eV) 
to locate Eg" (Fig. 4E, solid line). The resulting 
Rstn/sra gradients track the band dispersion 
close to the band asymmetry, as expected for 
the odd K combs (HSBs), more accurately 
than the raw HSB intensities (Fig. 4, A and ©). 
Nevertheless, dg can be measured most sen- 
sitively through the rich details of HSB spectra 
(supplementary materials). The same analysis 
is repeated for the ratio of 6th and 4th HSBs 
in Fig. 4F for & Ren/4tn (blue color maps), 
Feeney Roth Jan (red color maps), and E$" (black 
line). The peaks of these gradients track the 
band dispersion, although less sharply than 
for the odd HSBs because the latter benefits 
from super-resolution, as discussed above. The 
gradient approach also allows us to unfold 
the left- and right-hand side of the bands with 
respect to the K point. We have furthermore 
verified that using C, = 1.3 in Eq. 3 yields only 
minor tomographic aberrations, making pure- 
ly experimental super-resolution tomography 
possible, whereas fixing these aberrations yields 
a nontrivial value of C,, and thus experimen- 
tal access to intriguing many-body effects in 
quantum materials (supplementary materials). 

We introduce a powerful all-optical band- 
structure tomography by merging three comple- 
mentary advantages of the harmonic sideband 
generation: An optical field selects a pair of 


Gradient (norm. ) 
1 0 -1 





Wave vector, k 


Fig. 4. Super-resolution lightwave tomography. (A to D) Color maps show the computed (A) 3rd, (B) 
4th, (C) 5th, and (D) 6th HSB intensity constructed from a comprehensive (Ethz, VtHz) scan and compared 
with one-dimensional experimental (Ety, and vryz) scans (black boxes). Each (Etyz, vtyz) iS connected 
with the wave vector by using Eq. 3 and the nth HSB energy A(vopt + NvtHz). Also, the electron-hole dispersion 
(black line) and eal (dotted line) are shown. (E and F) Computed gradients of 5th:3rd (E) and 6th:4th 

(F) HSB taken in k (blue color map) and energy (red color map) directions, compared with the E (black 
line); only the first encountered gradient peaks are shown. 
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valence and conduction bands at a time, a 
strong multiterahertz field introduces crystal- 
momentum combs (kK combs) that localize the 
origin of the emission to a small region within 
these bands, and the resulting excitations di- 
rectly involve the quasiparticles relevant for 
quantum effects. The k combs are demon- 
strated to deliver a super-resolution lightwave 
tomography of quantum materials by using 
one- and two-dimensional multiterahertz 
frequency-field strength scans. Our approach 
can be straightforwardly generalized for a fully 
three-dimensional band tomography by add- 
ing the direction of the multiterahertz field as 
the third scanned variable. Also, geometric 
phase effects can be characterized by measur- 
ing the polarization of the sideband emission 
with respect to optical excitation. The intro- 
duced methodology seems optimally suited 
for high-precision characterization and control 
of quantum materials such as two-dimensional 
monolayer heterostructures, even on micro- 
scopic scales. 
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De novo design of potent and resilient hACE2 decoys 


to neutralize SARS-CoV-2 


Thomas W. Linsky'*, Renan Vergara’, Nuria Codina’*, Jorgen W. Nelson’, Matthew J. Walker’, 
Wen Su*, Christopher 0. Barnes®, Tien-Ying Hsiang“, Katharina Esser-Nobis“, Kevin Yu’, 

Z. Beau Reneer”, Yixuan J. Hou*, Tanu Priya’, Masaya Mitsumoto!, Avery Pong’, Uland Y. Lau’, 
Marsha L. Mason’, Jerry Chen’, Alex Chen’, Tania Berrocal’, Hong Peng’, Nicole S. Clairmont’, 
Javier Castellanos’, Yu-Ru Lin’, Anna Josephson-Day’, Ralph S. Baric®, Deborah H. Fuller’, 
Carl D. Walkey’, Ted M. Ross®®, Ryan Swanson’, Pamela J. Bjorkman®, Michael Gale Jr.*, 

Luis M. Blancas-Mejia’, Hui-Ling Yen’, Daniel-Adriano Silva’ + 


We developed a de novo protein design strategy to swiftly engineer decoys for neutralizing pathogens 
that exploit extracellular host proteins to infect the cell. Our pipeline allowed the design, validation, and 
optimization of de novo human angiotensin-converting enzyme 2 (hACE2) decoys to neutralize severe 
acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The best monovalent decoy, CTC-445.2, bound 
with low nanomolar affinity and high specificity to the receptor-binding domain (RBD) of the spike 
protein. Cryo—-electron microscopy (cryo-EM) showed that the design is accurate and can simultaneously 
bind to all three RBDs of a single spike protein. Because the decoy replicates the spike protein target 
interface in hACE2, it is intrinsically resilient to viral mutational escape. A bivalent decoy, CTC-445.2d, 
showed ~10-fold improvement in binding. CTC-445.2d potently neutralized SARS-CoV-2 infection of 
cells in vitro, and a single intranasal prophylactic dose of decoy protected Syrian hamsters from a 


subsequent lethal SARS-CoV-2 challenge. 


ince its emergence as a global pandemic 

in December of 2019, severe acute respi- 

ratory syndrome coronavirus 2 (SARS- 

CoV-2) has caused millions of COVID-19 

cases. The need for effective strategies 
to prevent and treat the disease remains ur- 
gent (1). There are multiple ongoing efforts to 
develop prophylactics and therapeutics using 
various approaches (2) such as vaccination (3), 
traditional protein engineering (1, 4, 5), de 
novo protein design (6), and small-molecule 
drug discovery (7). A challenge is that the 
high mutational rate of positive sense single- 
strand RNA (+ssRNA) viruses (8-10) can often 
lead to viral escape (D, which could compro- 
mise the efficacy of many SARS-CoV-2 ther- 
apeutics under development. Several mutations 
have already occurred in the S protein of SARS- 
CoV-2 in the infected population (12, 13). Deep- 
sequencing studies of the receptor-binding 
domain (RBD) have shown that simple muta- 
tions can enable the virus to escape known 
netralizing antibodies or to increase its bind- 
ing affinity for human angiotensin-converting 
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enzyme 2 (hACE2) (14, 15), the membrane pro- 
tein that the virus exploits to gain entry into 
the cell. There is thus a pressing need to de- 
velop new therapeutics that can be more re- 
sistant to SARS-CoV-2 mutational escape. 
Traditional approaches to combatting vi- 
ruses (e.g., vaccination and monoclonal anti- 
bodies) rely on molecules interacting with 
the pathogens in a way that is fundamentally 
different from how the pathogen engages with 
its cellular targets (16, 17). Viruses can be se- 
lected to evade neutralization, undergoing pro- 
tein mutations that prevent recognition by 
the neutralizing molecules (e.g., antibodies) 
while preserving viral fitness. To address these 
challenges, we have developed a computational 
protein design strategy that enables the rapid 
and accurate design of hyperstable de novo 
protein “decoys” that replicate the protein re- 
ceptor interface to which a virus binds to infect 
a cell. The decoys can achieve a similar or even 
higher affinity than the original protein recep- 
tor by stabilizing the binding interface. There- 
fore, at an optimal concentration, the decoys 
can outcompete viral interaction with the cell. 
SARS-CoV-2 invades host cells in a two-step 
process (J8-20). The S protein RBD attaches to 
the cell by binding to hACE2, a membrane- 
associated protein, triggering protease-mediated. 
fusion with the cell membrane (27). The process 
is Similar to the beta-coronaviruses HCoV-NL63 
and SARS-CoV-1, which also target hACE2 for 
cellular entry (22). In principle, inhibiting the 
viral interaction with hACE2 should prevent 
infection. We applied our design strategy to 
engineer, validate, and optimize de novo hACE2 
decoys to neutralize SARS-CoV-2 infection (Fig. 
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1J and fig. S1). The design of the decoys started 
by identifying the structural motifs that form 
the hACE2 binding interface with the SARS- 
CoV-2 RBD. We based our effort on three pub- 
licly available structures of hACE2 in complex 
with the RDB of the S protein for SARS-CoV-1 
(PDB: 6CS2) and SARS-CoV-2 (PDBs: 6VW1 
and 6M17) (23-25). Four discontiguous bind- 
ing elements were identified (Fig. 1A) and the 
three largest interacting motifs were selected 
to build the de novo decoys: two long alpha 
helices (H1 and H2) and a short beta hairpin 
(EE3) (Fig. 1A and fig. S2). To generate mol- 
ecules that are biologically inert for humans, 
our computational design strategy avoided 
incorporating elements of hACE2 that are 
known (or predicted) to be biologically active, 
such as the catalytic site. Inspired by recent 
developments in the design of de novo struc- 
tural elements (26-29), we built new disem- 
bodied de novo secondary structure elements 
tailored to support the target structural ele- 
ments in a way that is both compatible with 
globular folding and would stabilize the bind- 
ing interface (Fig. 1B and materials and meth- 
ods). Then, in a strategy similar to the design 
of Neoleukin-2/15 (Neo-2/15) (26, 30), a com- 
binatorial design approach based on Rosetta’s 
“protein_mimic_designer” was used to gener- 
ate multiple fully connected protein topologies 
containing all of the desired structural and 
binding elements (26). The design of the pro- 
tein decoys was constrained to fully preserve 
(intact up to each amino acid’s conformation) 
the target binding interface (Fig. 1, A and B, 
and fig. S2) so that the de novo proteins would 
be resilient to viral mutational escape. Rosetta 
(31) was then used to generate amino acid se- 
quences predicted to fold into the target struc- 
tures, and the designs were evaluated with an 
automatic filtering pipeline based on nine 
computational parameters, including predic- 
tions of smooth folding funnels into a stable 
native state (Fig. 1, C and D) (32). 
Approximately 35,000 computational ACE2 
decoys were generated, and the top-ranking 
196 designs (see the materials and methods) 
were selected for experimental testing for bind- 
ing to SARS-CoV-2 RBD using yeast display 
(Fig. LE). With no further optimization, the de- 
sign CTC-445 showed strong (nanomolar) and 
specific binding for SARS-CoV-2 RBD (Fig. 1E, 
fig. S3, and materials and methods). CTC-445 
is a 160-amino acid protein comprising 18 
of the natural amino acids; it does not contain 
cysteine or tryptophan residues. It exhibited 
~10-fold weaker binding affinity for SARS-CoV-2 
than did hACE2 [disassociation constant (Kp) ~ 
357 nM, Kp ~ 31 nM, respectively; table S1] and, 
as a result, CTC-445 was a weak competitor of 
SARS-CoV-2 RBD binding to hACE2 [median 
inhibitory concentration (ICs9 @ hacE2fo4nM]) = 
1.7 uM; Fig. 11). We determined that low po- 
tency of CTC-445 was due to a certain degree 
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A Structure of SARS-CoV-2 RBD 
in complex with ACE2 

































Computational design of de novo ACE2 decoys 
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Day 1 (B) 
De novo computational 
design begins 


Day 21 (E) 
CTC-445 
identified 


Fig. 1. Design and characterization of de novo ACE2 decoys. (A) ACE2 
(gray) and its binding motifs (H1 19-52, orange; H2 55-84, green; EE3 346-360, 
blue) in complex with SARS-CoV-2 RBD (pink). Three starting structures were 
simultaneously used as targets (see main text); 6VW1 is shown. (B) De novo 
secondary structure elements (magenta) were computationally generated to 
stabilize H1, H2, and EE3. Seven combinations of secondary structure elements 
were considered. Circles are a-helices, triangles are B-sheets, filled circles are 
helices oriented forward, and empty circles are helices oriented backward. 

We used Rosetta to generate fully connected backbones (using the “protein_ 
mimic_designer” algorithm) and amino acid sequences predicted to fold into the 
target structure. In all cases, the binding interface of ACE2 with the SARS-CoV-2 
RBD was preserved intact (see the materials and methods). (C) Automatic 
computational filtering based on eight metrics selected the best candidates. The 
RMSD of the binding motifs to ACE2 was also used as a quality check. The 
dots indicate the mean computational score for each design scored against 
the three target RBD structures. Designs selected for experimental testing are 
shown in black. Our best design, CTC-445, is shown in red. The blue boxes 
indicate the filtering thresholds (see the materials and methods). (D) Designs 
that passed filtering were subjected to biased forward folding simulations (see 
the materials and methods), here shown for CTC-445, including the unsalted 
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Day 33 (G) 
Directed evolution 
of CTC-445 initiated 













Day 67 (H) 
Inhibition of SARS-CoV-2 RBD binding to 
ACE2 by (monovalent) CTC-445.2 


biased simulation (brown), the native-salted simulation (orange), and relaxation 
(blue). (E) The top 196 designs were selected for yeast display screening using a 
combination of Rosetta score per residue, the ddG Rosetta filter, and the folding 
simulations (see the materials and methods). The designs were individually 
assessed for specific binding to SARS-CoV-2 spike RBD (Fc fusion, 200 nM). The 
plot for CTC-445 is shown. (F) CTC-445 was recombinantly expressed and purified 
by affinity chromatography (see the materials and methods). Analytical size 
exclusion chromatography (SEC) for CTC-445 revealed the presence of oligomeric 
species. (G and H) CTC-445 was optimized by directed evolution and rational 
combination of the observed favorable mutations (G), leading to CTC-445.2 (SEC), 
which is mainly monomeric in solution (H) and ~1000x more potent to compete 
with ACE2 than its parent [see (G)]. We further optimized the potency of our 
molecule by generating a bivalent version named CTC-445.2d. (I) Potency of designs 
to outcompete binding of SARS-CoV-2 RBD to ACE2, as measured by competition 
enzyme-linked immunosorbent assay (ELISA) using a constant concentration of 
0.4 nM ACE2. (J) Timeline of the de novo protein design and optimization pipeline. 
Timewise, green indicates phases that we believe were performed optimally, 

red indicates those that can potentially be avoided in future efforts, and yellow 
indicates phases that can potentially be expedited by using more advanced and/or 
automated methods for gene synthesis, cloning, and high-throughput screening. 
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of instability of its folded state [free energy dif- 
ference between folded and intermediate states 
(AG) ~-2.7 kcal mol”, melting transition tem- 
perature (T,,) ~75.3°C; Figs. 1F and 2B and fig. 
S5]. A single round of directed evolution to im- 
prove stability and binding affinity, and sub- 
sequently the rational combination of the five 
most frequent observed mutations (none of 
them in the binding interface), led to the pro- 
tein decoy CTC-445.2 (Fig. 1G, figs. S6 and S7, 
table S2, and materials and methods). CTC- 
445.2 is predominantly monomeric (Fig. 1H 
and fig. S8), thermodynamically hyperstable 
(AGyx ~-5.0 kcal mol, Tm ~93°C; Fig. 2B and 
fig. S5), exhibits low nanomolar affinity for the 
RBD of SARS-CoV-2 (Kp ~21.0 nM; table SI), 
has improved cross-reactivity to SARS-CoV-1 
(Kp ~7.1 uM; Fig. 2C and table S1), and can 
efficiently compete hACE2 binding to the SARS- 
CoV-2 RBD (ICs @ hACE2[0.4nM] ~10.4 nM; Fig. 
11). The amino acid sequence of CTC-445.2 
has little identity with hACE2, in terms of 


either linear or structurally aligned sequence 
(ClustalW ~22%, MICAN ~ 4%, respectively; 
fig. S9). Serial duplication (i.e., increase in 
avidity) of CTC-445.2 led to higher-potency 
molecules with favorable biochemical prop- 
erties. For example, CTC-445.2d (Fig. 2A), a 
bivalent version of CTC-445.2, had an ~10-fold 
improvement in binding affinity for both SARS- 
CoV-2 RBD (Kp ~3.5 nM; table S1) and SARS- 
CoV-1 RBD (Kp ~587 nM; Fig. 2C and table 
S1), and a similar increase in its ability to 
compete with hACE2 binding to SARS-CoV-2 
RBD (IC5o @ nacr2[o.4nm] ~700 pM; Fig. 11). A 
trivalent version of CTC-445.2 resulted in 
even higher (picomolar) binding affinity and 
a matching hACE2 competition potency (Kp 
~270 pM, ICso @ nack2[0.4nm) ~10 pM; fig. S10 
and table S1). In a cross-reactivity binding assay 
containing >21,000 human proteins, we con- 
firmed that CTC-445.2d bound to the SARS- 
CoV-2 RBD with high selectivity (fig. S11 and 
materials and methods). 
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Fig. 2. Stability and binding of the de novo protein decoys CTC-445, CTC-445.2, and CTC-445.2d. 

(A) Design models of CTC-445, CTC-445.2, and CTC-445.2d. CTC-445.2 contains five mutations that were guided 
by directed evolution experiments. CTC-445.2d is a bivalent variant composed of two CTC-445.2 subunits 
linked by a 16-mer flexible GS linker (sequence -GGGSGGSGSGGSGGGS-). (B) Circular dichroism of 
recombinantly expressed CTC-445 (red), CTC-445.2 (blue), and CTC-445.2d (orange). Thermally induced 
melting of the decoys was followed by its circular dichroism signal at 208 nm (heating rate, 2°C/min). The inset 
Shows far ultraviolet (UV) wavelength spectra at 20°C (purple), after heating to ~95°C (brown), and after 
cooling the heated sample to 20°C (green dashed). Complete ellipticity spectra recovery (full reversibility) upon 
cooling was observed in all cases. Calculated 7,, values for CTC-445, CTC-445.2, and CTC-445.2d are 75.3 + 
0.2°C, =93°C, and 71.7.4 0.2°C, respectively. (C) Binding was assessed using biolayer interferometry 
(OCTET) binding assays of CTC-445, CTC-445.2, and CTC-445.2d against immobilized SARS-CoV-2 RBD 
(top) or SARS-CoV-1 RBD (bottom) (see table S1). The model fitting is shown with dotted black lines. 
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Single-particle cryo-EM structures of CTC- 
445.2 in complex with the SARS-CoV-2 S trimer 
showed that the de novo decoy is capable of 
simultaneous binding to all three RBDs of 
the SARS-CoV-2 trimeric S protein, both in the 
“up” and “partially down” RBD conformations 
(Fig. 3, A to D, and fig. S12). To accurately 
model the CTC-445.2-RBD interactions, we 
used focused classification and local refine- 
ment on the subset of particles that showed 
CTC-445.2 bound to a partially down RBD, 
which yielded a 4.1-A map with improved 
CTC-RBD features relative to CTC-RBD regions 
on the up RBDs (Fig. 3, A to D, and figs. S12 
and S13). The computationally derived model 
of CTC-445.2 closely matched the cryo-EM- 
determined structure [Co root mean square 
deviation (RMSD) = 1.1 A], with minor differ- 
ences observed in the N-terminal EE3 and H2 
helix (Fig. 3, E to H). As designed, the binding 
interface of the SARS-CoV-2 RBD with CTC- 
445.2 closely mirrored the target hACE2 in- 
terface. We used site saturation mutagenesis 
(SSM; see the materials and methods) (33, 34) 
to explore the effect of single-amino acid sub- 
stitutions in CTC-445.2 on its binding to the 
SARS-CoV-2 RBD (Fig. 3, I and J). The exper- 
iment showed that mutations in the core of the 
design are disallowed, and mutations in sur- 
face or exposed residues are generally tolerated 
(Fig. 3, I and J). The SSM experiment also re- 
vealed that there is room to further improve 
the affinity of the protein by introducing muta- 
tions in the binding interface (Fig. 3D, although 
doing so would break the hACE2 structural 
mirroring of the de novo decoy. 

We also performed an SSM experiment for 
the SARS-CoV-2 RBD binding interface to com- 
pare the effect of single-amino acid substitution 
on binding to hACE2 or CTC-445.2. As predicted, 
the effects of ~1700 SARS-CoV-2 RBD mutations 
showed a strong correlation between binding to 
hACE2 and CTC-445.2 (R? = 0.84, Pearson’s r = 
0.92; Fig. 4 and fig. S14), highlighting the decoy’s 
intrinsic resiliency to mutational escape. At 
low target concentrations (100 pM), CTC-445.2 
had a large binding advantage over ACE2 for 
many of the RBD mutations (fig. S14), likely a 
result of both its higher stability and smaller 
size. Although CTC-445.2 was resilient to viral 
mutations in the RBD-binding interface, we 
observed some decoy-binding-weakening mu- 
tations that had a lesser effect on hACE2 bind- 
ing. Therefore, viral mutational escape might 
still be possible if multiple (decoy-binding- 
weakening) RBD mutations are combined. 

The high and specific binding affinity of the 
optimized de novo protein decoys translated 
into effective and specific in vitro neutralization 
of SARS-CoV-2 viral infection (Fig. 5). In vitro, 
the presence of the de novo decoys had no 
impact on mammalian cell viability (Fig. 5A 
and fig. S15) or the enzymatic activity of hACE2 
(fig. S16). Both of the decoys were able to fully 
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Fig. 3. Cryo-EM structure of the CTC-445.2-S complex. (A to D) Cryo-EM 
reconstructions of CTC-445.2 (blue) bound to soluble spike trimers (gray). 3D 
classification revealed four distinct classes: one CTC-445.2 bound to an “up” RBD (A), 
two CTC-445.2 bound to two “up” RBDs (B), two CTC-445.2 bound to one “up” and one 
“down” RBD (C), and three CTC-445.2 bound to two “up” and one “down” RBD (D). 
(E) Overlay of CTC-445.2-RBD computationally modeled (yellow) and experimentally 
determined using cryo-EM (blue). The Ca RMSD between the design model and the 
refined experimental structure is 1.1 A. (F to H) Comparison of cryo-EM CTC-445.2 (blue), 
computationally modeled CTC-445.2 (yellow), and hACE2 (green) at the interface of 

the RBD (gray). (I) Deep mutational scanning heatmap showing the average effect on the 
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enrichment for single site mutants of CTC-445.2 when assayed by yeast display for 
binding to the SARS-CoV-2 RBD (binding assayed at RBD concentrations of 100, 50, 25, 
12.5, 6.25, 3.125, and 1.5625 pM; see the materials and methods). (J) Design model of 
CTC-445.2 colored by average enrichment at each residue position [from the data 
in (1)] bound to SARS-CoV-2 RBD (gray). As expected, mutations in the core of 

the design or to positions involved in binding to the RBD are generally disallowed. 
The deep mutational scanning revealed that there is still room to further improve the 
binding affinity of CTC-445.2, including mutations in the binding interface that in 
principle could afford higher potency and selectivity at the cost of compromising the 
decoy’s mutational escape resiliency (see Fig. 4). 
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Fig. 4. Resilience of CTC-445.2 to SARS-CoV-2 
RBD mutational escape. (A) Deep mutational 
scanning (DMS) of the SARS-CoV-2 RBD 
interface was performed to assess the effect 
on binding (by yeast display) to CTC-445.2 
(top) or hACE2 (bottom) at eight different 
concentrations (656, 218, 72, 24, 8, 2, 0.3, and 
0.1 nM; fig. S16 and materials and methods). 
The heatmaps indicate the effect on binding for 
each possible single amino acid mutation in 
the hACE2-binding interface of the RBD (see 
the materials and methods). The results are the 
average over all the concentrations tested. 

A black square represents lack of expression in 
the naive (unselected) library. The color bars at 
the bottom indicate the secondary structure 
element with which a given RBD residue 
interacts: H1, orange; H2, green; EE3, blue; 
and H4, magenta. Approximately 1700 single 
mutations were targeted by the experiment. 
(B) The SARS-CoV-2 RBD surface is colored 
according to the per-residue-averaged 
enrichments for binding to CTC-445.2 

(top) or hACE2 (bottom). For reference, 

the structure of CTC-445.2 or ACE2 (respec- 
tively) is shown in semitransparent gray 
cartoons. (C) The 2D scatter plots compare 
the enrichment values [as in (A)] for the DMS of 
the RBD binding to CTC-445.2 (y-axis) versus 
hACE2 (x-axis). There is a high correlation 
between the effect of RBD mutations in the 
binding of both molecules, demonstrating the 
mutational resilience of the de novo decoy 
(Pearson's r = 0.92). 


neutralize viral infection in in vitro systems of 
cell infection. Briefly, in a vesicular stomatitis 
virus (VSV) pseudovirus system expressing the 
SARS-CoV-2 S protein, the decoys specifically 
protected human embryonic kidney (HEK) 
293T cells overexpressing hACE2 from infec- 
tion (fig. S15). The decoys also were able to 
fully neutralize infection by SARS-CoV-2 
(SARS-CoV-2 nanoLuc; see the materials and 
methods) in the lung epithelial cell line Calu-3 
expressing both ACE2 and the transmem- 
brane protease serine 2 (TMPRSS2) (35, 36) 
[median effective concentration <5 nM ata 
multiplicity of infection (MOI) of 1.0; Fig. 5A]. 
In an in vitro time-of-addition assay using the 
Vero E6 cell line, CTC-445.2 and CTC-445.2d 
were most effective at neutralizing SARS-CoV- 
2 infection when continuously present in the 
cell media throughout the full course of infec- 
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the viral challenge afforded 100% survival from 
a lethal SARS-CoV-2 challenge (5 x 10° plaque- 
forming units of SARS-CoV-2; Fig. 5C). Specif- 
ically, by day 7, all control animals that received 
the viral challenge but not CTC-445.2d (n = 7) 
exhibited severe distress and required eutha- 
nasia. By contrast, hamsters that received a 
single dose of CTC-445.2d 12 hours before 
challenge all survived (n = 8), with modest 
weight loss and few or no clinical signs of dis- 
tress (Fig. 5C and table S5). 

Our de novo protein design approach to 
generate decoys is orthogonal to traditional 
therapeutics and has the potential to better 
overcome mutational viral evasion. Natural 
proteins repurposed often present substantial 
challenges for development as therapeutics; 
these include low stability, which can compli- 
cate manufacturing, transport, and storage; 


tion (as opposed to only before or after infec- 
tion; Fig. 5A and figs. S16 to S18), confirming 
that their mechanism of viral inhibition is 
extracellular neutralization of the virus. 

To determine the potential of our molecules 
to be used as respiratory-delivered therapeu- 
tics, we intranasally administered a single dose 
of CT'C-445.2d to Balb/c mice (100 ug dose of 
CTC-445.2d in a 30-uL droplet) and observed 
the presence of the fully functional decoy for 
>24 hours in the lungs and respiratory tract 
of mice (Fig. 5B and fig. S19). A 14-day course 
of daily CTC-445.2d intranasal administra- 
tion in mice (100 ug of CTC-445.2d in a 30-uL 
droplet) was well tolerated, causing no adverse 
effects (Fig. 5B). In a Syrian hamster model for 
SARS-CoV-2 infection, a single prophylactic intra- 
nasal dose of CT'C-445.2d (560 ug of CTC-445.2d 
in a 100-uL droplet) administered 12 hours before 
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Fig. 5. In vitro virus neutralization by CTC-445.2d. 
(A) Top left: In vitro neutralization of NanoLuc 
SARS-CoV-2 by CTC-445.2d in Calu-3 cells after 

72 hours of incubation at a MOI of 1.0. Top right: A 
cell viability assay (48 hours) confirmed that the 
decoys are not cytotoxic to Calu-3. Bottom 

left: In vitro neutralization of live BetaCoV/Hong 
Kong/VM20001061/2020 SARS-CoV-2 virus in Vero 
E6 cells at a MOI of 1.0. The cells were incubated 
with CTC-445.2d throughout infection and the colors 
indicate the following: orange, before infection, 
during infection, and after infection; black, after 
infection only; and gray, before infection only. 
SARS-CoV-2 RNA copy numbers were determined 
by quantitative real-time reverse transcription 
polymerase chain reaction. All assays were per- 
formed in triplicate unless otherwise noted, and all 
data points are shown. Bottom right: Cell viability in 
Vero E6 cells was independently performed (CCK8 
assay) and it was confirmed that the de novo decoys 
are not cytotoxic. (B) In vivo mouse pharmacokinetics 
and tolerability of intranasally administered CTC- 
445.2d. Left: Plot showing the concentration of 

fully functional CTC-445.2d (i.e., capable of binding 

to the SARS-CoV-2 RBD; see the materials and 
methods) found in homogenized lungs of Balb/c 

mice after a single 100 ug dose, measured at various 
times after dosing (n = 5 mice). Right: Body weight of 
mice after repeat daily intranasal doses of CTC- 
445.2d (100 ug; n = 18 at day 0) compared with 
control [phosphate-buffered saline (PBS)-treated] 
mice (n = 5). At each time point, three CTC-445.2d- 
treated mice were sacrificed for lung examination. 
Weight data shown are for the remaining mice (n = 
18, 15, 12, 9, 6, and 3 at days 1, 2, 4, 8, 11, and 14, 
respectively). No significant weight loss or lung 


abnormalities were observed. Error bars indicate the standard deviation. (C) In 
vivo Syrian hamster SARS-CoV-2 challenge. Left: Body weight measurements 
through day 10 for unchallenged hamsters (n = 5, red) compared with SARS-CoV-2- 
challenged hamsters treated either with a single dose of CTC-445.2d (day O 

at -12 hours; n = 8, orange) or PBS (day -1, day O at -12 hours, day 1, and day 2; 
n= 7, gray). Right: Survival plot. Hamsters were euthanized when they displayed 


residual (and undesirable) biological activ- 
ity; and the risk of eliciting an autoimmune 
response (37-46). By contrast, the de novo 
protein decoys are amenable for large-scale 
manufacturing in traditional bacterial sys- 
tems, and their thermodynamic hyperstability 
can enable simplified transport and storage. 
Other recent protein-engineering efforts have 
generated neutralizing proteins character- 
ized by extremely high binding affinities for 
SARS-CoV-2, with Kps ranging from low nano- 
molar to femtomolar [e.g., mAb 2B04 (47); 
LCBI1 (6); and the nanobody Nb6 (48)]. Never- 
theless, the de novo decoy’s resilience to viral 
escape is a distinctive feature of our design 
strategy (Fig. 4 and figs. S14 and S20). A pos- 
sible shortcoming is that a decoy’s require- 
ment to replicate a natural binding interface 
can intrinsically limit the maximum binding 
affinity attainable. However, we have demon- 
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(see table Sd). 


strated that the binding affinity (and potency) 
of the de novo decoys can be increased both 
by further sequence optimization (e.g., CTC- 
445.3d; fig. S21) or through avidity, allowing 
our trivalent decoy CTC-445.2t to reach the 
picomolar affinity range (Fig. 3I and fig. S10). 
It is possible that avid versions of CTC-445.2 
coupled with more refined linkers (rigid and 
with proper spacing for binding simultaneously 
to multiple RBD subunits) might lead to larger 
increases in binding potency. 

We demonstrate rapid design of a therapeu- 
tic lead; further speed improvements to our 
pipeline are theoretically attainable, for exam- 
ple by using high-throughput experiments to 
rapidly select and optimize the designs (Fig. 1G). 
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Mechanisms of collision recovery in flying beetles 


and flapping-wing robots 


Hoang Vu Phan’2* and Hoon Cheol Park'2* 


At rest, beetles fold and tuck their hindwings under the elytra. For flight, the hindwings are deployed 
through a series of unfolding configurations that are passively driven by flapping forces. The 

folds lock into place as the wing fully unfolds and thereafter operates as a flat membrane to 
generate the aerodynamic forces. We show that in the rhinoceros beetle (Allomyrina dichotoma), 
these origami-like folds serve a crucial shock-absorbing function during in-flight wing collisions. 
When the wing collides with an object, it collapses along the folds and springs back in place within 
a single stroke. Collisions are thus dampened, helping the beetle to promptly recover the flight. 
We implemented this mechanism on a beetle-inspired wing on a flapping-wing robot, thereby 


enabling it to fly safely after collisions. 


ost flying insects live in cluttered forest 
environments that are covered by dense 
vegetation, tree branches, and bushes. 
Although insects are capable of obsta- 
cle avoidance using their sensory sys- 
tems (J, 2) in such complex conditions, obstacle 
collisions of the wings of insects are more likely 
and may affect their flight performance or even 
cause wing damage (3). Because of their vul- 
nerable wings, insects use biomechanical strat- 
egies to protect them (4, 5). When at rest, the 
wings of most insects are able to fold along 
the body to reduce the risk of damage (6). In 





Fig. 1. Beetles flap their hindwings using 
wing inertial effect to unfold. (A) Detail of 
the hindwing and folding patterns. RA, radius 
anterior; BZ, bending zone; MJ, marginal joint; 
MP, media posterior; FJ, flexible joint; RP, 
radius posterior; +, convex fold; -, concave fold. 
The orange-shaded areas denote the reverse- 
folding area. Scale bar, 1 cm. (B to D) Time 
history of the data during the hindwing 
unfolding process driven by the beetle (black) 
and a flapping mechanism (red): (B) fold angle 
(5); (C) stroke angle (); (D) stroke accelera- 
tion (). Lines, shaded areas, and shaded 
columns denote the average, +SD, and the 
upstroke motion, respectively. (E and F) Fold 
angles in the first five wing beats in various 
unfolding tests of intact wing (IW), clipped 
flexible joint (CFJ), clipped membrane 
surface (CMS), and motor-driven wing (MDW) 
during (E) downstroke and (F) upstroke 
motions. Error bars represent +SD. 

(G) The apical field is unable to be locked 

in its folded configuration and unfolds quickly 
to outspread shape. (H) Torque-deflection 
relationship of the RA3 (wing tip) while 
folding creases are tightened. (I) Stored 
energy with respect to the deflection angle 
enabling the self-unfolding of the wing tip. 
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particular, beetles (Coleoptera) use a pair of 
hardened forewings (elytra) to protect their 
flexible hindwings and body. Most beetles are 
capable of performing many modes of loco- 
motion, such as flying, crawling, and even bur- 
rowing their way underground. Such abilities 
are supported by a hard exoskeleton covering 
the body and the ability of foldable hindwings. 
To generate lift, the hindwings are often larger 
and longer than the body and elytra and must 
be therefore folded so they can be tucked un- 
der the hard protective elytra. Much progress 
has been made in addressing questions on the 
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structures and operations of the folding mech- 
anism, when the beetles are at rest (7-17). Most 
birds and bats are capable of partial wing- 
folding during the upstroke and full wing 
extension during the downstroke to create 
efficient flight (72, 13). In particular, pigeons 
were found to use a wing-folded posture when 
traversing confined spaces (14). By contrast, 
beetles rarely use the folding mechanism dur- 
ing their normal flight locomotion, because 
beetles (and most insects) cannot actively 
flex their wings, as compared with birds 
and bats. 

To explore the use of folding mechanism dur- 
ing beetle flight, we performed experiments 
and analyses of free-flying rhinoceros beetles 
[Allomyrina dichotoma (fig. S1)] under colli- 
sion impact on the hindwing, mimicking the 
challenges that they face when flying in clut- 
tered habitats. We first investigated how this 
beetle unfolds its hindwings for flight. Typi- 
cally, two unfolding patterns of the hindwing 
are observed in most beetles: (i) unfolding com- 
pletely before flapping [examples: earwigs 
Forficula auricularia (10) and ladybird beetles 
Coccinella septempunctata (1D] and (ii) flap- 
ping before unfolding [examples: sun beetles 
Pachnoda marginata (9) and stag beetles 
Dorcus titanus platymelus (15)]. In the first 
pattern, the unfolding is driven by stored elastic 
energy in the wing (7, 10). Beetles with the 
second unfolding pattern actively use their 
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thoracic muscles (6, 9) or hydraulic mecha- 
nism (1/5) to unfold. The rhinoceros beetle 
uses the second pattern by first opening its 
elytra to promote the hindwings at the wing 
base while retaining the same folded con- 
figuration at the marginal joint (MJ; Fig. 1A). 
The wing starts to flap and unfolds completely 
within two wing beats (movie S1). The unfold- 
ing process occurs often during the upstroke 
and reaches fully extended configuration at 
the dorsal stroke reversal, at which the wing 
angular acceleration reaches its peak (Fig. 1, 
B to D). Thus, we hypothesized that the un- 
folding is aided by flapping forces, i.e., inertial 
and aerodynamic forces during fast flapping 
speed (fig. S2), rather than from active tho- 
racic muscular control. 

To address the hypothesis, we cut off the 
hindwing of the beetle at the wing base and 
immediately attached it to a flapping mech- 
anism (fig. S3). By actuating the hindwing at 
a similar frequency to that operated by the 
beetle (38 Hz), we observe that the hindwing 
is successfully unfolded at the MJ to deploy 
the apical field, even though the radius ante- 
rior 3 (RA3) was not fully retained in the out- 
stretched configuration during the upstroke 
(Fig. 1, B to F, and movie S1). To keep the 
apical field in the outstretched shape, the 
flexible joint (FJ) connecting the radius pos- 
terior (RP) and RA1 is a necessary component 
(Fig. 1, E and F; fig. S4; and movie S2). The 
flapping forces trigger the folding-hand-fan- 
like movement of the media posterior 1 and 2 
(MP1+2), RP, and RAI to tighten the FJ and 
the diamond-shaped creases at the MJ (figs. 
S5 and S6). As a result, the RA3 can be un- 
folded to fully deploy the apical field. Without 
the membrane, the hindwing requires more 
flaps to unfold (fig. S4 and movie S2), suggest- 
ing that aerodynamic forces also play a role 
in the unfolding. 

By tightening the FJ and folding creases in 
the median field, we found that the wing tip 
is unable to lock in the fold configuration and 
springs back to an outstretched shape (Fig. 
1G and movie S3). In the outspread wing, 
the measured torque-deflection response of 
the MJ shows that the torque increases and 
remains maximum (locking torque) as the 
deflection angle increases (Fig. 1H). Subse- 
quently, the torque declines at high deflec- 
tion angles (>40°). During folding, energy is 
stored, enabling the self-unfolding of the 
wing tip (Fig. 11). These observations sug- 
gest that the foldability allows wings to act 
as shock absorbers during in-flight colli- 
sions. They also explain how the wing main- 
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tains its outspread shape during flapping 
motions. After flight, the hindwings can be 
folded and tucked under the elytra. The fold 
can be triggered by the abdominal movement 
to push the RP3+4 for its transverse fold, re- 
sulting in a locked configuration of the creases 
(movie S4) (7, 11). 

We conducted experiments in which beetles 
flew past and often collided with fixed vertical 
posts made of carbon tubes (fig. S7). We ob- 
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served two kinds of collisions: mid-wing colli- 
sion (from the wing base to the MJ) and wing-tip 
collision (from the MJ to the wing tip). Because 
the collisions occur randomly during free flights, 
we counted the collision patterns and flight 
behaviors after collision (table S1 and fig. S8). 
In each collision pattern, we classified three 
groups of behaviors: stable flight, tumble, and 
perching. Among a total 127 flight trials of 
8 individuals (14.9 + 6.6 trials per individual, 
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Fig. 2. Beetles survive in-flight wing collision by collapsible wings. (A and B) Flight behavior after wing 
collisions: (A) mid-wing collision and (B) wing-tip collision (median, interquartiles, and range). P, perching; 
T, tumble; F, stable flight. ***P < 0.001 (two-sample, two-tailed t test). (C to E) Sequential sketches of 
representative colliding flights with different behaviors after collision: (C) perching, (D) tumble, and (E) stable 
flight. (F) Fold angles of the collided wings in (D) (black) and (E) (green). (G to Q) Flight trajectories, 
body attitudes, and body rates of the beetles during collision flights with presence (green) and absence 
(black) of the folding function. (G) Flight trajectories (Z) and (H) changes in downward velocity (Aug). 

(I to K) Time course of the body attitude angles: (I) roll (Ag), (J) pitch (A@), and (K) yaw (Ay). These angles 
were biased from initial state and adjusted corresponding to the right wing collisions. (L to N) Maximum 
deviation between the attitude angles before and after the collision: (L) roll (|A@|max), (M) pitch (|A®|max), 
and (N) yaw (|Aw| max). **P < 0.01, ***P < 0.001. (O to Q) Time course of the body angular rates. Thin lines 
indicate individual flight trials and thick lines represent median values. The time instant when the wing 


first hits the post is set at O ms. 
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Fig. 3. A mechanical wing can be passively folded and speedily unfolded 
adapting the beetle’s hindwing. (A) Implementation of the folding wings on the 
17.8-g tailless flapping-wing robot. (B) Folding wing in fold configurations: 
longitudinal fold (top) and transverse fold (bottom). Scale bar, 1 cm. (C) Torque 
required to fold the wing tip at the BJ in longitudinal (dark blue) and transverse 
(black) folds. Yellow color denotes the longitudinal fold behavior without the 
reinforced linkage (RV3 in fig. S17). (D) Stored energy during folding. (E) Self- 
unfolding response of the wing tip. Line and shaded area denote the average + SD. 


(F) Image sequence of the colliding flapping motion in one flapping cycle. 
(G to J) Time course of the data in the colliding flapping motion: (G) stroke angle 
(o); (H) fold angle (8); (1) wing rotational angle (a,) at 75% wingspan; (J) yaw 
torque (M-) produced during wing-tip collision. Red-dashed frame denotes 

the collision period. Shaded column denotes the upstroke motion. (K) Yaw torque 
production in wing collision without folding mechanism. (L) Heading angle and 
(M) rate of the 17.8-g robot in wing-tip collision trials with (orange) and without 
(black) folding mechanism. 


mean + SD), 58 were “mid-wing collision” and 
69 were “wing-tip collision” (Fig. 2). The bee- 
tles approached the posts at a mean flight 
speed of 0.59 + 0.12 m/s per flight trial (fig. 
S9). In “mid-wing collision,” the beetles used 
their outspread legs to catch the obstacle for 
perching (movie S5), followed by “tumble” 
events (movie S6), and only a few “stable flight” 
events occurred. During flight, the legs were 
outspread with their lengths covering most of 
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the RA length (fig. S10). Thus, if the obstacle 
approached the RA, the beetles were able to 
use either front legs or middle legs with hooks 
at the tip to catch the obstacle. By contrast, in 
“wing-tip collision,” the beetles were mostly 
able to recover stable flight (movie S7). A few 
“wing-tip collision” cases caused the beetles 
to tumble, but none of the beetles used their 
legs to perch. Most of the “stable flight” events 
were assisted by the folding mechanism, which 
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is passively folded in longitudinal pattern at 
the MJ because of the collision (Fig. 2F). Be- 
fore flapping, the folding plane was parallel 
to the body and approximately normal to 
the stroke plane (fig. S10). However, during 
flapping, the RA twisted near the wing base 
to rotate the plane of folding at the stroke 
reversals, enabling the folding plane to ap- 
proach the stroke plane. The change of fold- 
ing plane allows the MJ (and the wing) to 
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avoid critical collision damage caused by out- 
of-plane deformation. 

To investigate the role of the folding mech- 
anism on the flight of the beetle, we classified 
the collisions into two events, “collision with- 
out folding” found in both mid-wing and wing- 
tip collisions and “collision with folding” in 
wing-tip collision, and analyzed the flight paths 
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and body attitude angles [DLTdv digitizing 
tool (16); Fig. 2, G to Q, and fig. S11]. In the 
“collision without folding” events, the wing 
hit the post 4.75 + 1.45 times per flight trial 
(fig. S9). The post disrupted the flapping wing 
motion (mostly during downstroke), causing 
substantial changes in the flight path and body 
attitude. As a result, the beetle lost altitude and 
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Fig. 4. A beetle-inspired, hover-capable, flapping-wing robot can fly with foldable wings after sustaining 
wing-tip collision. (A to C) Free hovering flight performance of the robot: (A) composite images in 

time sequences of takeoff and hovering flight (scale bar, 20 cm); (B) three-dimensional flight trajectory; 
(C) attitude angles. Ref., reference. (D to J) Remotely piloted flight demonstrations with wing collision. 

(D and E) Composite side-view images of the robot in time sequences of wing-collision flights without (D), 

and with (Ei) wing-folding function (scale bar, 10 cm). (Eii) Bird's-eye view snapshot images of the robot in (Ei) 
during upstroke (3 ms) and downstroke (20 ms) wing-tip collisions. (F) Corresponding flight trajectory, (G) altitude 
versus time, and body attitude angles [(H) roll, (I) pitch, and (J) yaw] of the robot with the presence (green) 
and absence (black) of the folding mechanism. The wing starts to hit the obstacle at time t = O ms. 
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often tumbled. However, in the “collision with 
folding” events with 5.47 + 3.12 collisions per 
trial, the flight paths and body kinematics were 
less perturbed than those in the “collision with- 
out folding” events. When the wing tip hit the 
obstacle, it collapsed along the folds. After 
passing the obstacle, the wing tip returned to 
its outstretched configuration in less than 4 ms 
within the stroke (12.5 ms at 40 Hz) owing to 
the tightened creases, and hence, the collision 
occurred during both downstroke and upstroke 
motions. Thus, the folding mechanism absorbed 
the reaction force of the obstacle affecting the 
hindwing without disrupting flapping motion. 

We analyzed the wing kinematics of the left 
and right hindwings in a representative “colli- 
sion with folding” flight trial (Fig. 2E and figs. 
S12 to S15). We found that collisions altered 
the deviation and reduced the stroke ampli- 
tude and frequency in the collided wing, caus- 
ing asymmetric force generation of the two 
wings and changes in the body attitude. To 
stabilize, the beetle immediately increased the 
stroke amplitude of the right-collided wing 
while reducing that of the left wing. Using a 
quasi-steady blade element model (17) (see sup- 
plementary materials), we showed that al- 
terations in the stroke amplitudes produced 
unequal vertical forces in the two wings, which 
predominantly contribute to roll-torque gener- 
ation (figs. S12 to S15 and table S2). However, 
this alteration caused reduction of the lift 
force, resulting in loss of flight altitude. Other 
components were not considerably affected 
by the alteration. Although the cycle-average 
yaw torque or flapping yaw counter-torque (8) 
is negligible, we observed yaw angle change in 
the recovery phase. This yaw motion could be 
generated from the coupling result of the roll 
torque and forward motion (19). Thus, the beetle 
can actively alter stroke amplitudes of the two 
wings, creating compensating torques to re- 
cover flight stability. In the “collision with- 
out folding” events, despite increasing the 
stroke amplitude in the collided wing after 
collision, the beetles could not significantly 
recover the flight, because of high-collision 
impacts (fig. S16). 

These results show that the rhinoceros bee- 
tle leverages folding mechanism to mitigate the 
impact of wing-tip collision on its flight per- 
formance. This mechanism is appealing for 
application in small flying robots, in particu- 
lar, bioinspired flapping-wing robots (20, 21). 
Many insect-inspired flying robots have been 
developed (22-26), with the aim of use in con- 
fined space and hazardous environments. 
To mitigate the risk of in-flight collisions, 
approaches using folding frames (27) or pro- 
tective systems (28, 29) have been considered 
in rotary-wing drones. However, these concepts 
are not applicable on flapping-wing robots, as 
they are either unable to support flight recovery 
or heavy and cumbersome. 
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Adapting the beetle’s hindwing, we built 
a passively foldable wing on a 17.8-g beetle- 
inspired flying robot (Fig. 3A and tables S3 
and S4) (30). The wing could be passively 
folded and rapidly unfolded (within 7 ms) at 
the bending joint (BJ) located at the mid-span 
of the leading edge, which mimics the MJ of 
the beetle’s hindwing (Fig. 3, B to E; figs. S17 
and S18; and movie S8). It can perform two 
folding modes at the BJ, i.e., longitudinal fold 
and transverse fold, owing to the vein arrange- 
ment. The longitudinal fold is triggered by 
the BJ made of 0.2-mm super-elastic nickel- 
titanium alloy and the hand-fan-like fold of 
the reinforced vein (RV3 in fig. S17), which pre- 
vents undesired folds during flapping motion 
(see supplementary materials, figs. S19 and 
S20, and movie S9). In this folding mode, the 
shape of the torque-deflection curve (Fig. 3C, 
dark blue) is similar to that of the beetle’s hind- 
wing. Meanwhile, the transverse fold is formed 
by the buckling of the BJ. Energy stored during 
folding enables the rapid unfolding of the 
wing tip (Fig. 3, D and E). This folding mech- 
anism thus allows the wing tip to collapse and 
cross the obstacle, before quickly returning 
to an outstretched shape within a single flap- 
ping stroke (Fig. 3, F to J, and movie S10). To 
investigate the wing performance, we measured. 
yaw torque and response due to collision in 
the presence and absence of the folding mech- 
anism. The results showed that the folding 
mechanism substantially reduces the collision 
impacts (up to two-thirds of the yaw torque 
and yaw rate) and thus minimizes change 
in the heading direction (Fig. 3, J to M, and 
fig. S21). 

The flying robot integrated with the fold- 
able wings could generate enough vertical force 
to freely take off and hover with onboard elec- 
tronics and power source (Fig. 4, A to C; figs. 
S22 and S23; table S5; and movie S11). To 
demonstrate the use of the folding mechanism 
during flight, we conducted experiments on 
wing-collision flight of the robot with ver- 
tical posts (Fig. 4, D to J; figs. S24 to S26; 
and movies S12 to S14). The tests show that 
the folding mechanism enables the robot 
to maintain stable flight after wing-tip col- 
lision. In the absence of the folding mecha- 
nism, the robot tumbles. This confirms the 
usefulness of a passive-folding wing in miti- 
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gating in-flight wing-collision impacts. The 
tests also demonstrate the capability of the 
robot in flying through a narrow gap that is 
smaller than its wingspan, which is impossi- 
ble for conventional flying robots without a 
folding function. 

In summary, we demonstrated that the rhino- 
ceros beetle uses different biomechanical 
strategies to survive in-flight collision by passive- 
folding mechanism or by legs, depending on 
collision points. Inspired by the beetle’s hind- 
wing, we showed how the passively foldable 
wing can be implemented in a hover-capable 
flapping-wing robot to recover the flight after 
collision. The foldable wing thus presents an 
effective approach to deal with the wing colli- 
sions, helping to increase the mission dura- 
tions of the robot. However, adding the folding 
mechanism comes at a cost of reduced power 
efficiency, as it requires more inertial power to 
accelerate the wing. Any undesired deflection 
of the BJ during flapping motion also affects 
the aerodynamics and flight stability. Despite 
the trade-offs, the above findings and results 
bring us closer to biological locomotion strate- 
gies existing in nature for potential applications 
in flying robots, which can perform multiple 
locomotion modes as a current trend of bio- 
inspired robotic technology (23). 
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Long-term collapse in fruit availability threatens 
Central African forest megafauna 
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Afrotropical forests host much of the world’s remaining megafauna, although these animals are confined 
to areas where direct human influences are low. We used a rare long-term dataset of tree reproduction 
and a photographic database of forest elephants to assess food availability and body condition of an 
emblematic megafauna species at Lopé National Park, Gabon. Our analysis reveals an 81% decline in 
fruiting over a 32-year period (1986-2018) and an 11% decline in body condition of fruit-dependent 
forest elephants from 2008 to 2018. Fruit famine in one of the last strongholds for African forest 
elephants should raise concern about the ability of this species and other fruit-dependent megafauna to 
persist in the long term, with potential consequences for broader ecosystem and biosphere functioning. 


he largest plants and animals on the 
planet are disproportionately important 

for the metabolism and functioning of 

our ecosystems (J). However, they are 

also more susceptible to extinction (2), 

and “global downsizing” through loss of mega- 
fauna and megaflora is likely to have nega- 
tive consequences for the biosphere (J). The 
African tropics are home to much of the 
world’s remaining wild megafauna, but even 
here they are mainly confined to areas where 
direct human influences are low (3, 4). Secur- 
ing the future of these megafauna populations 
will depend on our ability to protect against 
human threats, such as habitat loss and hunt- 
ing for international trade (5, 6), as well as to 
maintain the health and productivity of the 
habitats that support these species. The creep- 
ing influence of the Anthropocene means that 
rapid climate and atmospheric changes will 
be felt even where direct human pressures are 
low, with potential for far-reaching impacts on 
habitats and species (7). Detecting, quantifying, 
and understanding changes to the health and 
functioning of the remaining safe havens for 
threatened megafauna should be a Key conser- 
vation priority during this time of rapid change. 
In this study, we used a rare long-term dataset of 
plant reproduction and a photographic database 
of forest elephants to assess food availability 
and body condition of an emblematic species of 
megafauna within a Central African rainforest. 
Fruit is a keystone resource for many of the 
African megafauna (8-0). Fruit production is 
highly dependent on climatic cues, and there 
is abundant evidence that climate change has 
already caused shifts in the timing of plant 
reproduction in temperate regions (17-13). Data 
on reproduction of tropical plants are generally 
scarce or have not been recorded for long 
enough to track such changes (4). However, 
tropical plants will be affected by climate 
change, and changes in tropical tree reproduc- 
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tion have been detected at several sites for 
which long-term data are available (15-17). The 
mechanistic causes of these changes are not 
universal and are mostly unknown. The criti- 
cal minimum temperature hypothesis describes 
how, in some Afrotropical plant species, flow- 
ering is not triggered until temperatures drop 
below a certain threshold (J8). For these spe- 
cies, we would expect reproduction to be cued 
less often as temperatures rise. 

Lopé National Park, Gabon, is a nationally 
designated protected area and a UNESCO 
(United Nations Educational, Scientific and 
Cultural Organization) World Heritage site 
and is a relative safe haven for Central African 
megafauna, such as great apes and forest ele- 
phants (3, 4, 19). It also hosts the longest con- 
tinuous study of tropical tree phenology in 
Africa (20). From 1986 to the present, re- 
searchers at the site have monitored tree spe- 
cies that are important to the diet of gorillas, 
chimpanzees, and elephants. Once a month, 
focal tree crowns are observed from the ground 
(via binoculars), and the proportions of each 
canopy that are covered with flowers, fruit, 
and leaves are recorded (21). We used these 
data to quantify changes in the probability of 
encountering flowers, unripe fruit, and ripe 
fruit for 73 species over a 32-year period 
(1986-2018; n = 260,431 monthly crown ob- 
servations, 7 = 2007 focal tree crowns; tables 
S1 and S2), while accounting for individual 
and species-level variation by using general- 
ized linear mixed models (GLMMs) with a bi- 
nomial error structure (22). We replicated this 
analysis for a subset of species (7 = 14 species) 
that bear fruit previously identified as espe- 
cially important in the diet of forest elephants 
during a 30-month dung study and an 8-year 
observational study of elephant diet at the site 
(9). We also calculated a ripe fruit availability 
score for each year as a proportion of maxi- 
mum theoretical fruit availability (equivalent 
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to all trees of all species bearing 100% canopy 
cover of ripe fruit for 12 months of the year) by 
selecting species that had been monitored 
continuously throughout the study period (n = 
40 species) and, separately, a subset of these 
species that are especially important compo- 
nents of the elephant diet (nm = 7 species). 

We found that trees at Lopé are reproducing 
less often and that the probability of encount- 
ering flowers and fruit has declined signifi- 
cantly over time (GLMM;; tables S3 and S4: and 
fig. S1). The average rate of encountering ripe 
fruit for all species (2 = 73) dropped from 1 in 
every 10 trees in any given month in 1987 to 
fewer than 1 in 50 by 2018, a substantial 80.9% 
decline (Fig. 1A and table S4). Although there 
was some variation between species, all spe- 
cies declined except one (Dacryodes buettneri; 
table S5 and fig. S2). Encounters of ripe fruit 
important for elephants (n = 14) declined at 
an even faster rate (-87.8%), from one in five 
trees in 1987 to fewer than 1 in 40 in 2018 
(Fig. 1A). A general reduction in fruit availa- 
bility for species monitored throughout the 
time period demonstrates that larger fruiting 
events have not compensated for the dimi- 
nished frequency of fruit encounters (Fig. 1B). 
Availability of fruit important for elephants 
dropped steeply between 2000 and 2003, and 
the yields of the best years for fruit production 
after 2004 were lower than those of the poor- 
production years before 2000 (Fig. 1B). Al- 
though the long dry season (June to September) 
has always been a time of fruit scarcity (0), 
historical seasonality in fruit availability has 
disappeared in recent years, owing to a major 
reduction in ripe fruit observed from October 
to March, which was previously the season of 
abundance (Fig. 1C). 

Our results showing declines in the proba- 
bility of encountering flowers and unripe fruit 
(fig. S1) indicate that suppressed production of 
ripe fruit is not primarily a pollination or fruit 
maturation issue. Nor is it likely to be due to 
any negative consequences of tree senescence, 
because we removed trees that died or were 
diseased prior to analysis and found no evi- 
dence that tree size (as a proxy for tree age) 
influenced reproduction over time (fig. S3). 
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Instead we consider it likely that climate 
changes experienced at the site have contrib- 
uted to this shift in reproduction. Global 
warming has caused minimum daily temper- 
ature to increase, on average, by 0.25°C per 
decade at Lopé (23), which may be a key factor 
in reduced reproduction for some tree species 
that rely on a critical minimum temperature 
to trigger flowering (/8). Rainfall has simulta- 
neously decreased at the site by 75 mm per 
decade (23); thus, all tree species might be 
suffering because of water stress (24). 

Given the decline in fruit since 1986, it is 
likely that fruit-dependent wildlife—such as 
forest elephants, great apes, monkeys, and 
many bird species—has been affected. Long- 
term population data are not available for these 
species at Lopé. Other measures of popula- 
tion health, such as body condition, can be 
used to gauge population responses to envi- 
ronmental change over relatively short periods. 
Forest elephants, the largest frugivores in the 
ecosystem (9), have been consistently photo- 
graphed by researchers and visitors to the 
site since the late 1990s, resulting in a large 
photographic database (>80,000 photos). We 
used this database to evaluate annual and 
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Fig. 1. Changes in ripe fruit encounters and availability from 1986 to 
2018 at Lopé National Park, Gabon. (A) Results from analysis with a 
binomial GLMM (table $4), showing the change in probability of 
encountering ripe fruit over time for any given tree in any given month 
(solid lines) and 95% Cls (dashed lines). Summed raw data are 

plotted in fig. S1. (B) Interannual changes in ripe fruit availability for 

all species monitored consistently throughout the time period. Fruit 
availability is calculated as a proportion of maximum theoretical fruit 
availability; ripe fruit availability equals 1 when all trees in a subset have 
100% canopy cover of ripe fruit for 12 months in a year. (C) Results 
from analysis with a binomial GLMM (table $4), showing the seasonal 
change in probability of encountering ripe fruit for any given tree in each 





seasonal trends in the external body condi- 
tion of forest elephants between 1997 and 
2018, hypothesizing that elephant body con- 
dition has declined along with reduced food 
availability. Elephant body condition in photos 
was scored systematically by means of a 
custom-built web application and user inter- 
face (22). Scorers (n = 6 individuals) did not 
have access to the time and date of photo- 
graphs, nor to the research question and hy- 
pothesis. Scoring effort (number of photos 
viewed per scorer) varied, but we found high 
agreement among scorers who used a stand- 
ardized test database (mean intraclass corre- 
lation coefficient of 0.89, 2 = 200 photos). A 
total of 2823 photos met the strict image- 
quality criteria for scoring, and we used lin- 
ear mixed effects models (LMMs) to quantify 
changes in elephant body condition (account- 
ing for elephant age) over the full 21-year 
period, as well as separately for the first 11 years 
(1997 to 2007) and last 11 years (2008 to 2018) 
of data. 

We detected long-term declines in forest 
elephant body condition at Lopé (LMM). For 
the period between 1997 and 2018, mean body 
condition of the population declined by 5.0% 
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for all age classes. However, uncertainty was 
high, ranging from a small improvement in 
body condition to a large decline [LMM; 95% 
confidence interval (CI) = +0.9 to —11.2%; table 
S6]. The change was most pronounced in the 
second half of the time period (2008-2018) 
when body condition declined, on average, 
by 11.1% (LMM; 95% CI = -4.3 to -15.6%; 
Fig. 2A and table S6). Body condition varied 
seasonally in the first half of the time period 
(1997-2007), appearing to track fruit availa- 
bility (dipping in June and peaking after the 
long dry season) without any obvious lag at the 
monthly resolution of this analysis (Fig. 1B). 
The sharp drop in body condition in November 
during 1997-2007 was inconsistent with fruit 
availability, but closer inspection of the data 
indicated that this finding was highly in- 
fluenced by one very thin individual (body 
condition score = 2), recorded in November 
1999, out of only three photos available for this 
calendar month during this period (table S). 
In the 2008-2018 period, the peak in body 
condition after the long dry season had dis- 
appeared (Fig. 2B and table S6). However, 
the sparse data for 1997-2007 and high un- 
certainty in the CIs for seasonality in body 
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Fig. 2. Long-term and seasonal changes in elephant body condition from 1997 to 2018 at Lopé National 
Park, Gabon. (A) Change in elephant body condition from 2008 to 2018. Mean change (solid line) and 95% Cls 
(dashed lines) are from analysis with a LMM (table S6). Example images of different elephant body condition 
scores are shown at left; see fig. S4 for enlarged images. (B) Mean monthly elephant body condition 
and 95% Cls from analysis with LMMs (table S6) for the periods from 1997 to 2007 (no data for 


January and December) and from 2008 to 2018. 


condition mean that comparisons between 
the early and late time periods should be 
made with caution (tables S6 and S7). 

It is not known whether the changes ob- 
served in body condition in this study have 
affected forest elephant population health or 
dynamics in the study area. However, studies 
of African savanna elephants show that envi- 
ronmental stressors can have substantial long- 
term consequences for both individual fitness 
and population dynamics, with reproductive 
females and calves particularly affected (25). 
Reduced food availability could also act in sy- 
nergy with other factors (such as disease) to 
magnify negative physiological consequences 
(table S8). Although the biological mechanisms 
and consequences of declining body condition 
are unclear at this point, the effects on forest 
elephant populations across the region are un- 
likely to be benign, particularly when coupled 
with illegal hunting, habitat loss, and habitat 
degradation (3). 

These declines in both plant reproduction 
and elephant body condition are indicative of 
system-wide change and are expected to have 
disproportionate impacts on the functioning 
and metabolism of the ecosystem. A reduc- 
tion or displacement of historic populations 
of large frugivores in this region, along with 
diminished availability of seeds, could lead to 
collapse of seed dispersal (8), landscape-level 
shifts in habitat structure (26), reduction of 
carbon stocks (27), and potential for increased 
competition with humans for food (28). How- 
ever, long-lived plant and animal species are 
able to buffer environmental change to a cer- 
tain extent (J), and the broad diet of forest 
elephants (9) means that alternative food 
sources could offer some relief. Nonethe- 
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less, environmental stress can have lifelong 
negative impacts on elephant survival and 
fitness (25), and the impacts of these changes 
should be closely monitored. 

The long-term plant and animal datasets 
presented here are rare for the tropics, and it 
is possible that such changes may be occur- 
ring elsewhere, undocumented. These data 
serve as a reminder that even where direct 
human pressures are low, plant and animal 
communities may not be protected from the 
creeping influences of the Anthropocene. Co- 
ordinated international efforts to relieve direct 
human pressures and to halt and reverse cli- 
mate change will be critical to saving the re- 
maining megafauna and megaflora of the 
African tropics and preserving their special- 
ized roles in the functioning of our biosphere. 
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QUANTUM GASES 


Universal sound diffusion in a strongly interacting 


Fermi gas 


Parth B. Patel’**, Zhenjie Yan’, Biswaroop Mukherjee’**, Richard J. Fletcher’, 


Julian Struck'?*4, Martin W. Zwierlein’*3* 


Transport of strongly interacting fermions is crucial for the properties of modern materials, nuclear 
fission, the merging of neutron stars, and the expansion of the early Universe. Here, we observe 

a universal quantum limit of diffusivity in a homogeneous, strongly interacting atomic Fermi gas by 
studying sound propagation and its attenuation through the coupled transport of momentum and 
heat. In the normal state, the sound diffusivity D monotonically decreases upon lowering the 
temperature, in contrast to the diverging behavior of weakly interacting Fermi liquids. Below the 
superfluid transition temperature, D attains a universal value set by the ratio of Planck’s constant 
and the particle mass. Our findings inform theories of fermion transport, with relevance for 
hydrodynamic flow of electrons, neutrons, and quarks. 


ransport in fermionic quantum matter 

lies at the heart of phenomena as varied 

as superconductivity in cuprates (7) and 

bilayer graphene (2), inspirals of neutron 

star binaries (3), and perfect fluidity of 
the early universe (4). For hydrodynamic flow, 
transport is governed by diffusion, which 
facilitates the decay of spatial variations in 
globally conserved quantities—such as mo- 
mentum, energy, charge, or spin—at a rate 
set by the corresponding diffusivity. A ubiqui- 
tous example is the attenuation of sound in 
fluids, where the modulation in current den- 
sity and temperature causes diffusion of mo- 
mentum and heat, leading to attenuation of 
sound at a rate set by the sound diffusivity 
D. The magnitude and temperature depend- 
ence of sound diffusivity reveal many of 
the characteristic features of the underlying 
substance. 

Kinetic theory yields an estimate of D ~ vl, 
where v is the average velocity of the particles 
and J is their mean free path, which can vary over 
many orders of magnitude across substances. 
However, for strongly interacting quantum 
liquids and gases, a certain universality of dif- 
fusion coefficients may be expected. Here, the 
mean free path becomes on the order of the 
interparticle spacing, and the velocity takes on 
the Heisenberg-limited value v ~ i/ml, where 
his the reduced Planck’s constant and ™m is the 
particle mass. This leads to a limiting value of 
D ~ h/m, with a numerical prefactor of order 
unity determined by the specific microscopic 
interactions. Such Heisenberg-limited values 
were observed for the spin diffusivity in a uni- 
tary Fermi gas (5-7), as well as the momentum 
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diffusivity (the shear viscosity) in both the 
quark-gluon plasma of the early Universe and 
the unitary Fermi gas (4). Notably, the quan- 
tum liquids of bosonic “He and fermionic *He 
display similar sound diffusivities of D ~ h/m 
around 4 K (8, 9). However, upon lowering 
the temperature into the deeply degenerate 
regime, these two quantum liquids display 
markedly different behaviors in the damping 
of sound. Down to about 1 K, the sound at- 
tenuation in “He does not vary strongly with 
temperature, decreasing only by a factor of 
two across the superfluid transition, with a 
minimum of D ~ 0.5h/m (8, 10). By contrast, 
"He features a diverging diffusivity (<< 1/T?), 
characteristic of a Fermi liquid, growing to 
~50,000i/m around 2 mK, followed by a steep 
drop at the superfluid transition and settling 
to a value of ~5000h/m (9). A priori, it is un- 
clear whether the temperature dependence 
of sound attenuation in a strongly interacting, 
fermionic gas—of atoms, electrons, or neutrons— 
should resemble that of a quantum liquid, and 
if so, whether it corresponds more closely to 
the strongly interacting, but bosonic, liquid 
“He or to the fermionic, but weakly interact- 
ing, liquid °He. 

Ultracold atomic Fermi gases at unitarity 
are a prototypical strongly interacting quan- 
tum fluid for transport experiments (11-15). 
Featuring a mean free path as short as one 
interparticle spacing, these systems display the 
most robust form of fermionic superfluidity 
and near-perfect hydrodynamic flow even in the 
normal state (16-18). The presence of scale in- 
variance leads to universality in physics proper- 
ties 3-15, 19-21), including transport (5, 22-24). 
The universality directly connects this system 
to a host of other strongly interacting Fermi 
systems across all energy and length scales 
from nuclear matter to neutron stars. For the 
unitary Fermi gas, scale invariance implies 
that sound diffusivity must remain the same 
upon changing all length scales by the same 
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factor. The diffusivity is thus A/m times a uni- 
versal function of T/T, where the tempera- 
ture T is normalized by the Fermi temperature 
Tr that only depends on the particle density n 
(25). At nondegenerate temperatures T > Tp, 
we expect a unitary Boltzmann gas, where 
the thermal wavelength A = ,/2nh?/(mkgT) 
(kg, Boltzmann constant) sets both the mean 
free path and the typical velocity of excita- 
tions, J ~ 1/(nA*) and v ~ h/(md), implying 
D ~ (h/m)(T/ Tp)?! *. In the quantum critical 
regime of the unitary gas (26, 27), at T ~ Tp, 
the interaction and thermal energies are com- 
parable and even the nature of the equilibrium 
state is a subject of debate (28, 29). At low tem- 
peratures T«<T7y, it remains unknown whether 
the sound diffusivity diverges as 1/T? (30), as 
in the Fermi liquid “He (31, 32), and whether 
any sudden drop in the sound diffusion occurs 
upon entering the superfluid regime. Calcula- 
tion of such transport parameters is very dif- 
ficult; for example, predictions for the shear 
viscosity vary from zero (33), as suggested by 
experiments on expanding inhomogeneous 
gases (22, 24), to infinity if phonon damping 
dominates (23, 34-36). 

Transport experiments on Fermi gases have 
thus far used harmonic traps (/5) or terminal 
configurations (37, 38) and have been used 
to probe collective oscillations (39-41), spin 
transport (5, 7, 42), viscosity (22), conductivity 
(37), and Josephson oscillations (38). However, 
obtaining transport coefficients of homoge- 
neous matter from inhomogeneous samples 
in atom traps requires sophisticated analysis 
and assumptions on the spatial flow profile 
(5, 22, 43). With the recent advent of optical 
box traps (44-47), it is now possible to di- 
rectly probe the transport properties of homo- 
geneous quantum gases (47-50). The gas is then 
in the same state throughout, and transport 
properties are identical across the system. 

Measurements of transport properties in- 
volve the response of a system to an external 
drive. In linear response, an applied potential 
change ôV couples to perturbations in the fluid 
density 62 = —x6V by the density response 
function x. Sound corresponds to a resonant 
response, that is, a pole in y at a frequency 
œ = ck, set by the speed of sound c and wave 
number &, in the vicinity of which y(œ%, k) ~ 
1/(@° — ek? + iro) (51, 52). Here, 7 is the 
unit imaginary number and I is the damping 
rate of sound, given by [ = Dk? (53) for hydro- 
dynamic systems. Measurements of y and T 
thus directly provide the sound diffusivity. Ex- 
periments involving liquid helium have used a 
number of techniques to measure y, from free 
decay of resonant modes in a cylindrical reso- 
nator (8, 9) to Brillouin scattering off of sound 
waves (54). 

In our homogeneous quantum gas, the con- 
stant background density enables an ideal 
realization of a density response measurement 
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(Fig. 1A). We use an equal two-state mixture 
of Li atoms with resonant interstate inter- 
actions, confined to a cylindrical optical box 
potential composed of three repulsive laser 
beams: a hollow cylindrical beam providing 
the radial confinement (radius 60 um) and 
two sheets of light serving as endcaps (length 
L ~ 100 um) (45). The number N~10° of atoms 
per spin state yields a Fermi energy of Ep = 
h° ky? /(2m) ~ h x 10kHz. To inject sound 
waves, we sinusoidally modulate the inten- 
sity of one endcap beam, which drives the 
gas at a well-defined frequency o, and a wide 
range of spatial wave numbers, Fourier lim- 
ited by the width ~4 um of the endcap poten- 
tial’s edge (55). At the given driving frequency, 
the resonant sound response of the gas is 
dominated by a specific wave number k = w/c, 
resulting in a traveling wave of sound. An in 
situ absorption image is taken after an evolu- 
tion time sufficiently short such that no re- 
flections occur, and the resonant wave number 
k is directly measured (Fig. 1B, ii to iv). By re- 
peating this protocol for different drive fre- 
quencies, we obtain the dispersion relation 
w(k) for wave numbers k < 0.l4kr (Fig. 10). 
It is linear within our measurement error, 
corresponding to a constant speed of sound 
c = œ/k as a function of wave number. We note 
that at wavelengths approaching the inter- 
particle spacing, and thus at momenta hk ap- 
proaching the Fermi momentum (k ~ kp), 
deviations from linear sound dispersion are 
expected for the unitary Fermi gas (56). 

The precise measurement of the speed of 
sound allows a sensitive test of scale invar- 
iance of the unitary Fermi gas. In general, 
the speed of isentropic sound propagation 
cis directly tied to the equation of state by 
the hydrodynamic relation mc” = (OP/On), = 
(V?/N )(0°E/OV?)|,. Here, E is the energy, S is 
the entropy, V is the volume, and P = —(0E/OV )| , 
is the pressure of the gas. A notable property 
of all nonrelativistic scale invariant systems 
in three dimensions is that their total ener- 
gy scales as EV ~?/: this follows from the 
scaling behavior E > E/A” under dilation of 
space by a factor A. This directly yields mc? = 
(10/9)E/N, independent of temperature or 
the phase of matter. In Fig. 1D, we show the 
measured speed of sound as a function of 
the energy per particle E/N, obtained from 
an isoenergetic expansion of the gas from 
the box into a harmonic trap (57). For both 
superfluid and normal samples (blue and red, 
respectively), the scale invariant prediction 
(solid black line) captures the data well with 
no free parameters. This demonstrates the 
universality of the speed of sound and scale 
invariance in the unitary Fermi gas in the 
explored window of temperature. 

The attenuation of sound is already appar- 
ent in the spatial decay of the traveling waves 
shown in Fig. 1. For a precision measurement 
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of the sound diffusivity, we now turn to the 
steady-state response of the system to a con- 
tinuous drive, which directly reveals the den- 
sity response function y. The intensity of one 
of the endcaps is modulated for a sufficiently 
long time such that the density evolution has 
reached a steady state. After an integer num- 
ber of driving cycles, the spatial Fourier trans- 
form of the density yields the out-of-phase 
response of the system, or Im|y(, %)] (55). 
This quantity also gives the average power 
absorbed by the system for a drive at fre- 
quency œ and spatial frequency k, and thus 
directly reveals the poles of y as resonances. 
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The measurements are summarized in Fig. 2. 
Each row of pixels in Fig. 2B shows the frac- 
tional density modulation at a particular 
drive frequency after integration along the 
radial axis. This “sonogram” reveals discrete 
normal modes, the first five of which are 
shown in Fig. 2A. The spatial Fourier trans- 
form, giving the out-of-phase response func- 
tion, is shown in Fig. 2C. For each normal 
mode in the box, it features a peak at œ = ck. 
The sound attenuation rate can be seen to 
increase with k, revealed in both a broadened 
frequency response as well as a reduced peak 
height. 
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Fig. 1. Sound waves in a homogeneous unitary Fermi gas. (A) Fermionic °Li atoms are trapped in a 
three-dimensional cylindrical box made from green laser beams. Sound is excited by modulating the intensity 
of one of the laser walls. (B) The resulting density wave is observed via an in situ absorption image, shown 
as optical density (OD) for both an unperturbed (i) and modulated (ii) sample. Here, the modulation 
frequency is 2x x 600 Hz. Taking their difference (iii) and integrating along the homogeneous radial trap 
axis reveals (iv) a perturbation in the fractional density difference An/n, propagating along the axial 
direction z and exhibiting a well-defined wave number k corresponding to the applied modulation 
frequency œ. (C) Dispersion of sound w(k). The fitted slope (black line) provides the speed of sound. 
The insets display sound waves observed at w = 2x x 600 Hz and w = 2x x 850 Hz. Errors in the 
measured k are smaller than the point size. (D) Measurement of the universal relation between the 
measured speed of sound and the energy-per-particle E/N (see text). The black solid line shows 

the predicted linear dependence for any nonrelativistic scale invariant system in three dimensions; 
mc? = E/N. Data are shown for both the normal (red) and the superfluid (blue) phase. 
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The density response Im|x(o,%;)| at the 
wave number k; =jn/L of the j normal 
mode of the box is shown in Fig. 3A, along 
with Lorentzian fits (solid lines). The full- 
width at half-maximum yields the damping 
rate of sound I, which is shown as a function 
of k in Fig. 3B, for gases both above (red and 
green) and below (blue) the superfluid tran- 
sition. At temperatures above the superfluid 
transition temperature Te = 0.17Tr (20), we 
observe T (ķ) to increase quadratically with k 
for all explored wave numbers (k < 0.3 mc/h). 
This establishes diffusive damping of sound in 
the normal regime, as expected in the colli- 
sionally hydrodynamic regime (16, 58). 

Below the superfluid transition temperature, 
T < Tc, we observe a crossover from quadratic 
scaling of T (%) at wave numbers k < 0.2mc/h 
to linear behavior, indicating a departure from 
purely hydrodynamic transport at high wave 
numbers. This is expected when the modula- 
tion frequency becomes comparable to the 
damping rate of thermal phonons I pp (36, 59). 
Collisionless or Landau damping of sound is 
caused by nonlinearities resulting from the 
kinetic energy density carried by sound and 
the density dependence of the speed of sound. 
Fermi’s golden rule yields a rate [nk 
(36, 51) proportional to the energy Ack carried 
by a phonon. Including a nonzero damping rate 
of phonons Ih yields a crossover from hydro- 
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dynamic to collisionless damping as the sound 
frequency ck exceeds Ipn (59). The relation 
T = Dk’f (ck /T mh) with f (æ) = tan7"(a)/x (59) 
shows a good agreement with the data (solid 
line). The scale of the fitted relaxation rate 
[pn = 0.27(8)kgT/h is on the order of the gas 
temperature, hinting toward quantum critical 
damping (27). We note that the observation 
of quadratic scaling of r with k at low wave 
numbers implies that sound is primarily at- 
tenuated in the bulk, and that edge effects 
are negligible (53, 60). 

As the main result of this work, we present 
in Fig. 4 the sound diffusivity D of the unitary 
Fermi gas. This is obtained as D = T/K? using 
the j = 2 normal mode, for which ck/Iph is 
always less than 0.25, ensuring that it is well 
described by a hydrodynamic framework. The 
measured values are shown in units of h/m 
and, at low temperatures, exhibit a numerical 
prefactor ~1. This demonstrates universal sound 
diffusion, with no dependence on microscopic 
parameters of the gas. 

Generally, the sound diffusivity contains 
contributions from both the bulk and shear 
viscosity, ¢ and n, respectively (which damp 
momentum gradients), and the thermal con- 
ductivity « (which damps temperature gra- 
dients) (53). However, for a scale-invariant 
fluid, the bulk viscosity vanishes (67) and 
D = Dy + Dx only, with D, = 4n/(3mn) and 


An/n (%) 
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D, = 4«T/(15P) (55). We note that our mea- 
surements of D therefore constrain the rela- 
tionship between the viscosity and thermal 
conductivity, which is usually quantified 
by the Prandtl number Pr = cpn/x (53), 
where cp is the specific heat at constant 
pressure (55). 

The solid black line in Fig. 4 shows a 
prediction D = 6.46 (h/m)(T/Tg)*/”, which 
uses the high-temperature results for vis- 
cosity (23, 62) and thermal conductivity 
(23, 25), along with the ideal gas equation 
of state. This simple model captures the 
high-temperature behavior well without 
any free parameters. However, it is expected to 
underestimate D when T/Tr < 1 because it 
neglects the suppression of scattering arising 
from Pauli blocking. 

As the temperature is reduced, D smooth- 
ly drops to a value ~i/m, consistent with 
Heisenberg-limited diffusivity. Notably, at 
intermediate temperatures, we observe neither 
the D ~ 1/T? scaling typical of a Fermi liquid 
nor any sudden change at the superfluid 
transition. This is further demonstrated by the 
inset of Fig. 4, where we show a magnified plot 
of D (blue points) in the vicinity of the super- 
fluid transition (vertical red line) (20). Also 
shown is the pair condensate fraction (red 
points) obtained from the measured pair center- 
of-mass momentum distribution (63), which 
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Fig. 2. Normal modes of the cylindrical box trap. (A and B) The steady-state density response of the gas is obtained by modulating the container 

walls at frequency w for 30 cycles of the drive. Standing waves of sound corresponding to the normal modes in the box are observed at frequencies 

œj = jnc/L = 2nj x 7/Hz (where j € Z), the first five of which are shown in (A). The full sonogram is shown in (B). Here, each row of pixels corresponds to 
a particular realization of the experiment at a given frequency. (C) The spatial Fourier transform directly yields the density response function Im[x(k, w)]. 
It reveals well-defined resonance peaks exhibiting both the linear dispersion of sound and increasing widths in frequency at higher wave numbers, 
corresponding to increased rates of sound attenuation. 
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Fig. 3. Spectral response of sound and its 
attenuation rate. (A) The imaginary part of the 
density response function at each normal mode 
wave number k; displays a well-defined peak 

in frequency, whose full-width at half-maximum 
yields the mode damping rate Tr. This is obtained 
from a Lorentzian fit, shown by solid lines. 

(B) Damping rate T(k) for gas temperatures 
T/Te =0.36(5) (red circles), T/T = 0.21(3) 
(green squares), and T/Tr = 0.13(2) (blue 
triangles). For all temperatures, I'(k) displays a 
quadratic scaling at low momenta characteristic 
of diffusive damping. For our coldest samples, 
as k increases, we observe a deviation from this 
behavior, revealed by a crossover to linear scaling. 
At all temperatures and wave numbers, our data 
are well captured by the model of (59) 

(solid lines), which accounts for the finite 
relaxation rate of the fluid. Error bars represent 
lo statistical uncertainty. 


serves as both an indicator of superfluidity 
and a robust thermometer in the superfluid 
phase (15). Despite the definitive onset of pair 
condensation, we observe no measurable sharp 
feature in the diffusivity, which remains ap- 
proximately constant as the temperature is re- 
duced. Our nonvanishing diffusivity therefore 
contrasts previous measurements of a vanish- 
ing viscosity in the low-temperature limit (24). 
This behavior can qualitatively be understood 
as follows. In the superfluid phase, viscosity 
arises entirely from the normal component, 
giving a diffusivity D ~ (n,/n)lv where I = 
1/ (nno) is the mean free path of a typical ex- 
citation with scattering cross section o, ve- 
locity v, and density n, (30, 64). The dependence 
on the (strongly temperature-dependent) 7,, 
therefore cancels, giving D ~ v/(on). At the 
temperatures studied here, the normal com- 
ponent is dominated by broken pairs (15), whose 
velocity and cross section are only weakly 
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Fig. 4. Temperature dependence of the sound 
diffusivity. For temperatures comparable to 

the Fermi temperature, the sound diffusivity 

(D, normalized by A/m; blue circles) approaches 
the expected high-temperature scaling of T?/¢ 
(solid black line). As the temperature is lowered, 
D decreases monotonically and attains a quantum- 
limited value close to A/m. Below the superfluid 
transition [vertical red line, from (20)], D is 
observed to be almost independent of temper- 
ature and condensate fraction (nc/n, red circles 
in inset). From the transition temperature 

(Nc /n = 0) to the coldest temperatures 

(nc/n ~ 0.8), the changes in D are within the 
standard error of the measurements. Theoretical 
predictions for D are as follows: The dashed 
orange line is from the sound attenuation length 
calculated in the framework of kinetic theory 
(25), and the dashed green line is from a 
calculation of shear viscosity (23), assuming a 
Prandtl number of 2/3. Bars denote statistical 
error arising from the uncertainty in T. Addition- 
ally, the dominant systematic uncertainty in D is 
an error of 13% arising from the nonzero width 
of the endcaps. The red shaded regions represent 
the uncertainty in the superfluid transition 
temperature (20). 


temperature-dependent. Broken pairs are pri- 
marily formed at the Fermi surface, which is 
broadened by the pairing gap A. This results 
in a typical velocity v ~ Aky /m and cross section 
o ~ kp? (A/Erp)*, where the (A/Ep)” accounts 
for the restriction of phase space available for 
scattering arising from Pauli blocking. In the 
unitary Fermi gas, A ~ 0.44% (41, 65), giving a 
diffusivityD ~ h/m, consistent with our obser- 
vations. By contrast, the pairing gap in °He is 
A ~ 10-Eg, leading to a much larger value of 
D ~ 5000h/m (60, 66). 

We have measured the sound diffusivity 
of the unitary Fermi gas. The diffusivity ap- 
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proaches a Heisenberg-limited value of h/m 
at low temperatures, similar to the strongly 
interacting, bosonic quantum fluid “He. In con- 
trast to Fermi liquid behavior seen in weakly 
interacting fermionic systems, the diffusivity 
monotonically increases with increasing tem- 
peratures and eventually follows the high- 
temperature behavior D ~ h/m(T /Tz)*/*. The 
measured sound diffusivity constrains the 
shear viscosity and thermal conductivity of 
the unitary Fermi gas. In particular, combined 
with the calculated shear viscosity in (23), 
we find a Prandtl number strictly lower than 
unity for all explored temperatures (55). This 
excludes the existence of a relativistic con- 
formal gravity dual of the unitary Fermi gas 
(67), because this would require Pr = 1. Thanks 
to the scale invariance of the unitary Fermi 
gas, the results obtained here apply broadly to 
other strongly interacting forms of fermionic 
matter, from hydrodynamic electron flow to 
nuclei and neutron matter. 
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Robust neutralizing antibodies to SARS-CoV-2 
infection persist for months 


Ania Wajnberg"*, Fatima Amanat**, Adolfo Firpo*, Deena R. Altman?, Mark J. Bailey’, 
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Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has caused a global pandemic with 
millions infected and more than 1 million fatalities. Questions regarding the robustness, functionality, and 
longevity of the antibody response to the virus remain unanswered. Here, on the basis of a dataset of 
30,082 individuals screened at Mount Sinai Health System in New York City, we report that the vast 
majority of infected individuals with mild-to-moderate COVID-19 experience robust immunoglobulin G 
antibody responses against the viral spike protein. We also show that titers are relatively stable for at least 
a period of about 5 months and that anti-spike binding titers significantly correlate with neutralization 
of authentic SARS-CoV-2. Our data suggest that more than 90% of seroconverters make detectable 
neutralizing antibody responses. These titers remain relatively stable for several months after infection. 


evere acute respiratory syndrome coro- 

navirus 2 (SARS-CoV-2) has infected 

millions of individuals globally and, as 

of October 2020, has led to the death of 

>1 million individuals. Although the anti- 
body responses in severe COVID-19 cases have 
been relatively well characterized (1, 2), assess- 
ing the response in mild and asymptomatic 
cases is of great importance because they con- 
stitute most infections. It will be critical to 
understand the robustness of the antibody 
response in these mild cases, including its 
longevity and functionality, so as to inform 
serosurveys and to determine levels and du- 
ration of antibody titers that may be protec- 
tive against reinfection (3). 

Antibodies to SARS-CoV-2 can target many 
of its encoded proteins, including structural 
and nonstructural antigens. Thus far, two struc- 
tural proteins have been used as target antigens 
for serological assays. One is the abundant 
nucleoprotein (NP), which is found inside the 
virus or inside infected cells. However, because 
of the biological function of NP and because it 
is Shielded from antibodies by viral or cellular 
membranes, it is unlikely that NP antibodies 
can directly neutralize SARS-CoV-2. The sec- 
ond structural protein often used as a target 
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for characterizing the immune response to 
SARS-CoV-2 is the spike protein. The spike is a 
large trimeric glycoprotein that contains the 
receptor binding domain, which the virus uses 
to dock to its cellular receptor, angiotensin- 
converting enzyme 2, and for fusion of viral 
and cellular membranes (4, 5). It is known 
from other coronaviruses—and it holds true 
for SARS-CoV-2—that the spike is the main, 
and potentially the only, target for neutral- 
izing antibodies (6). Therefore, the assay 
used in this study to characterize the anti- 
body response to SARS-CoV-2 is based on 
the trimerized, stabilized ectodomain of the 
spike protein (7). An enzyme-linked immuno- 
sorbent assay (ELISA) initially developed 
in early 2020 has been extensively used in 
research (7-10). This so-called Mount Sinai 
ELISA has high sensitivity (92.5%) and speci- 
ficity (100%), as determined with an initial 
validation panel of samples (table S1). Fur- 
thermore, it has a positive predictive value 
(PPV) of 100%, with a negative predictive val- 
ue (NPV) of 99.6%. 

In March 2020, the Mount Sinai Health Sys- 
tem began screening individuals for antibodies 
to SARS-CoV-2 to recruit volunteers as donors 
for convalescent plasma therapy (77). Screened 
patients had either polymerase chain reaction 
(PCR)-confirmed SARS-CoV-2 infections or 
suspected disease, which is defined as being 
told by a physician that symptoms may be 
related to SARS-CoV-2 or exposure to some- 
one with confirmed SARS-CoV-2 infection. 
Most of the symptomatic patients who were 
screened experienced mild-to-moderate dis- 
ease, with <5% requiring emergency depart- 
ment evaluation or hospitalization. Mount 
Sinai also offered the antibody test to all 
employees in its health system on a volun- 
tary basis. By 6 October 2020, Mount Sinai 
had screened 72,401 individuals, with a total 
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of 30,082 individuals testing positive (defined 
as detectable antibodies to the spike protein 
at a titer of 1:80 or higher) and 42,319 testing 
negative. The clinical laboratory ELISA set up 
results in discrete titers of either 1:80, 1:160, 
1:320, 1:960, or =1:2880. Titers of 1:80 and 
1:160 were categorized as low titers, 1:320 as 
moderate, and 1:960 and =1:2880 as high 
titers. For plasma therapy, titers of 1:320 or 
higher were initially deemed eligible. Of the 
30,082 positive samples, 690 (2.29%) had a 
titer of 1:80; 1453 (4.83%), 1:160; 6765 (22.49%), 
1:320; 9564 (31.79%), 1:960; and 11,610 (38.60%), 
>1:2880 (Fig. 1). Thus, we conclude that the vast 
majority of positive individuals have moderate- 
to-high titers of anti-spike antibodies. Of course, 
the argument could be made that we could be 
missing a number of individuals who had been 
infected with SARS-CoV-2 and did not produce 
antibodies, given that many individuals in- 
cluded in our dataset had never been tested 
by a nucleic acid amplification test for the 
virus. An earlier analysis performed with a 
smaller subset of 568 PCR-confirmed indi- 
viduals using the same ELISA showed that 
>99% developed an anti-spike antibody re- 
sponse (8). In a later dataset of 2347 patients 
who self-reported positive PCR, 95% had 
positive antibody titers, which indicates that 
we did not miss large numbers of patients 
and confirms our prior sensitivity findings. 
Thus, the rate of individuals who do not 
seroconvert after SARS-CoV-2 infection is 
low, although such individuals may exist, 
and the majority of responders mount titers 
of 1:320 or higher. 

Determining the neutralizing effects of SARS- 
CoV-2 spike antibodies is critical to understand- 
ing possible protective effects of the immune 
response. Therefore, we performed a well- 
established quantitative microneutralization 
assay (12) based on authentic SARS-CoV-2 
with 120 samples of known ELISA titers rang- 
ing from negative to =1:2880. Neutralization 
titers significantly correlated (Spearman p = 
0.87, P < 0.0001) with spike-binding titers 
(Fig. 2A). Although there was some variability, 
sera with 1:320, 1:960, and =1:2880 ELISA 
titers had geometric mean 50% inhibitory 
dilutions (ID;9) of about 1:30, 1:75, and 1:550, 
respectively. If any and all neutralizing ac- 
tivity above background is considered, then 
~50% of sera in the 1:80 to 1:160 titer range, 
90% in the 1:320 range, and all sera in the 
1:960 to =1:2880 range had neutralizing ac- 
tivity (Fig. 2B). Only one of the negative samples 
showed activity slightly above background, 
which was potentially an ELISA false negative. 

Another important question is longevity of 
the antibody response to the spike. To assess 
the medium-range stability of serum antibody 
titers against the spike protein, we recalled 121 
plasma donors at a variety of titer levels who 
had initially been screened at around day 30 
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Fig. 1. SARS-CoV-2 spike antibody titers in 30,082 individuals. (A) The percentage of individuals with antibody titers of 1:80 (low), 1:160 (low), 1:320 (moderate), 
1:960 (high), and =1:2880 (high). (B) Absolute numbers of individuals testing positive and percent of individuals with titers of 1:320 over time. Testing of each 

sample was performed once in a Clinical Laboratory Improvement Amendments (CLIA)-certified laboratory using an assay that received emergency use authorization 
(EUA) from the U.S. Food and Drug Administration (FDA). 


Fig. 2. Neutralizing activity of serum 
samples in relation to ELISA titers. (A) A 
correlation analysis between ELISA titers on 
the x axis and neutralization titers in a micro- 
neutralization assay on the y axis. The 
Spearman p was determined. Red bars indicate 
the geometric mean. (B) The proportion of 
sera that exert any neutralizing activity in each 
of the ELISA titer categories. Testing was 
performed once, using an FDA EUA ELISA in a 
CLIA laboratory, or twice, following a stand- 
ardized neutralization protocol. 


A 


after symptom onset for two additional time 
points. The mean interval between the initial 
titer measurement and the second was 52 days 
(range: 33 to 67 days). This set the second time 
point at a mean of 82 days after symptom 
onset (range: 52 to 104 days) and the third 
time point at 148 days after symptom onset 
(range: 113 to 186 days). In comparing overall 
titers, we observed a slight drop from a geo- 
metric mean titer (GMT) of 764 to a GMT of 
690 from the first to the second time point 
and another drop to a GMT of 404 for the 
last time point (Fig. 3A). In the higher titer 
range of =1:2880 and 1:960, we also observed 
a slow decline in titer over time (Fig. 3, B and 
C). Unexpectedly, but in agreement with 
earlier observations that seroconversion in 
mild COVID-19 cases might take a longer time 
to mount (8), we saw an increase in individuals 
who had an initial titer of 1:320, 1:160, or 1:80 
(Fig. 3, D to F) from day 30 to day 82. Titers in 
these groups declined to about day 30 levels on 
day 148. Notably, one individual in the initial 
1:80 group dropped from a 1:80 titer to being 
negative at the day 82 time point, and two others 
lost reactivity at the day 148 time point, indi- 
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cating that very low initial titers might drop to 
undetectable levels over time. Neutralizing 
antibody titers followed titers measured by 
ELISA (Fig. 3G), and a good correlation be- 
tween neutralization and ELISA titers was 
still observed on day 148 (Fig. 3H). The initial 
serum antibody titer was likely produced by 
plasmablasts, and plasmablast-derived anti- 
bodies peak 2 to 3 weeks after symptom onset. 
Given an immunoglobulin G half-life of ~21 days, 
the sustained antibody titers observed here 
over time are likely produced by long-lived 
plasma cells in the bone marrow. Note that 
our observations contrast with a recent report 
that found waning titers at 8 weeks after virus 
infection (13). Especially in asymptomatic cases, 
antibody responses disappeared after 8 weeks 
in 40% of individuals in that study. However, 
the antibodies measured in that paper targeted 
the NP plus a single linear spike epitope. The 
same paper also reported relatively stable 
(slightly declining) neutralizing antibody 
titers, which shows much higher concordance 
with our present findings. Thus, the stability 
of the antibody response over time may also 
depend on the target antigen. 
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% of samples with neutralizing activity 
(cutoff IDs, of > 1:10) 


% with neutralizing activity 


neg. 1:80 1:160 1:320 1:960 >1:2880 


ELISA endpoint titer 


Correlates of protection have been estab- 
lished for many different viral infections. These 
correlates are usually based on a specific level 
of antibody acquired through vaccination or 
natural infection that substantially reduces 
the risk of (re)infection. One example is the 
hemagglutination inhibition titer for the in- 
fluenza virus, where a 1:40 titer reduces the 
risk of getting infected by 50% (/4). Similar 
titers have been established for the measles 
virus (an IDs, titer of 1:120), hepatitis A virus, 
hepatitis B virus, and many others (15). These 
titers have facilitated vaccine development con- 
siderably. For some viruses and vaccines, the 
kinetics of the antibody response is also known, 
allowing for an accurate prediction of how 
long protection will last (16). 

It is still unclear whether infection with 
SARS-CoV-2 in humans protects from rein- 
fection and, if it does, for how long. We know 
from work with common human coronaviruses 
that neutralizing antibodies are induced and 
that these antibodies can last for years and 
provide protection from reinfection or, in 
the event of reinfection, attenuate disease (17). 
Furthermore, we now know from nonhuman 
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Fig. 3. Antibody titer stability over time. (A) Titers of 121 volunteers whose 
blood was initially drawn ~30 days after COVID-19 symptom onset and who were 
then recalled for additional blood draws at ~82 days and 148 days after symptom 
onset. (B to F) The same data as in (A), but stratified by the initial (day 30) titer. 
Titers are graphed as geometric mean titers (GMT) with geometric standard 
error. (G) Neutralization titers of 36 individuals over time. A paired one-way 


primate models that infection with SARS- 
CoV-2 does protect from reinfection for at 
least some time (18, 19). We also know that 
transferring serum of convalescent animals 
or neutralizing monoclonal antibodies to naïve 
animals can be protective and reduces virus 
replication significantly (20, 21). Finally, vaccine- 
induced neutralizing antibody titers have been 
established as a correlate of protection in non- 
human primates (22). Notably, these vaccine- 
induced titers were relatively low and in the 
lower range of the titers observed in this study. 
Our data reveal that individuals who have re- 
covered from mild COVID-19 experience rela- 
tively robust antibody responses to the spike 
protein, which correlate significantly with neu- 
tralization of authentic SARS-CoV-2 virus. Fur- 
thermore, the vast majority of individuals with 
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antibody titers of 1:320 or higher show neu- 
tralizing activity in their serum. We also find 
stable antibody titers over a period of at least 
3 months and only modest declines at the 
5-month time point, which is consistent with 
data for the human coronaviruses SARS-CoV-1 
and Middle East respiratory syndrome-related 
coronavirus (MERS-CoV) (17). We plan to follow 
this cohort over longer intervals of time. Al- 
though we cannot provide conclusive evidence 
that these antibody responses protect from re- 
infection, we believe it is very likely that they 
will decrease the odds ratio of reinfection and 
may attenuate disease in the case of break- 
through infection. We believe that it is impera- 
tive to swiftly perform studies to investigate 
and establish a correlate of protection from 
SARS-CoV-2 infection. A correlate of protec- 
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analysis of variance corrected for multiple comparison was used to determine 
Statistical significance. (H) A correlation analysis between ELISA titers on the 
X axis and neutralization titers in a microneutralization assay on the y axis 

at day 148. Red bars indicate the geometric mean. The Spearman p was 
determined. Testing was performed once, using an FDA EUA ELISA in a CLIA- 
certified laboratory, or twice, following a standardized neutralization protocol. 


tion, combined with a better understanding of 
antibody kinetics to the spike protein, would 
inform policy regarding the COVID-19 pan- 
demic and would be beneficial to vaccine de- 
velopment efforts. 
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Temperature limits to deep subseafloor life in the 
Nankai Trough subduction zone 
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Microorganisms in marine subsurface sediments substantially contribute to global biomass. 
Sediments warmer than 40°C account for roughly half the marine sediment volume, but the 
processes mediated by microbial populations in these hard-to-access environments are poorly 
understood. We investigated microbial life in up to 1.2-kilometer-deep and up to 120°C hot 
sediments in the Nankai Trough subduction zone. Above 45°C, concentrations of vegetative 

cells drop two orders of magnitude and endospores become more than 6000 times more abundant 
than vegetative cells. Methane is biologically produced and oxidized until sediments reach 80° 

to 85°C. In 100° to 120°C sediments, isotopic evidence and increased cell concentrations 
demonstrate the activity of acetate-degrading hyperthermophiles. Above 45°C, populated zones 
alternate with zones up to 192 meters thick where microbes were undetectable. 


cientific ocean drilling has demonstrated 
the ubiquity of microbial life in deep sub- 
seafloor environments down to 2.5 km 
below seafloor (1-3). Because sediment 
temperature increases with burial depth, 
more than 50% of the global marine sediment 
volume is situated above 40°C (4). So far, the 
vast majority of subseafloor-life studies have 
targeted environments with in situ temper- 
atures <30°C; the habitability of hotter sedi- 
ments is largely unexplored. Microbes with 
growth temperatures up to 122°C have been 
isolated at hydrothermal vents (5), where the 
metabolism of these hyperthermophiles is 
fueled by high fluxes of oxidants and reductants 


(6). However, in deeply buried sediments, 
energy is limited, and with increasing depth 
and temperature, the slow-growing microbial 
communities struggle to meet the cellular 
maintenance energy requirement (3, 7, 8). 
Even in organic matter-rich petroleum res- 
ervoirs, microbial activity appears to cease at 
temperatures of ~80°C (9, 10). 

Aiming to fill the vast knowledge gaps re- 
garding the response of microbial life to in- 
creasing temperature, we investigated up to 
1.2-km-deep and up to 120°C hot sediments 
in the Nankai Trough off Cape Muroto, Japan 
(fig. S1). In this area, an up to 16-million-year- 
old, ~600-m-thick succession of hemipelagic 


mudstones and tuffs has been rapidly buried 
by an equally thick layer of trench deposits over 
the past ~0.4 million years (My) [(7)); fig. S2]. 
Sediments concurrently heated by about 50°C 
and the onset of subduction formed a décolle- 
ment separating the accreting and under- 
thrusted domains (JJ, 12). First indications for 
the presence of microbial life in ~800-m-deep, 
~80° to 90°C warm sediments at a nearby drill 
site date back two decades (11, 12). However, 
insufficient sensitivity in cell detection at that 
time compromised the habitability assessment 
of this environment (J/, 12). We designed Ex- 
pedition 370 of the International Ocean Dis- 
covery Program (IODP) to achieve maximal 
sensitivity in life detection together with ac- 
curate determination of in situ temperatures, 
and established Site C0023 (32°22.0018'N, 
134°57.9844'E, 4776-m water depth; fig. $1) in 
the vicinity of the previous drill site [(13); see 
supplementary materials]. Rigorous precau- 
tions during sampling and improvements in 
cell enumeration techniques increased the sen- 
sitivity in cell detection by five orders of magni- 
tude compared with the previous study (12). 
For the quantification of cells that can be stained 
by a fluorescent dye (hereafter termed vegeta- 
tive cells), the procedural blank was 4.2 + 
4.0 cells cm~° of sediment (N = 20), thereby 
yielding a minimum quantification limit (MQL) 
of 16 cells cm’. Temperature measurements in 
the borehole constrained a steady-state tem- 
perature profile with a gradient of 110°C km™ 
and a temperature of 120° + 3°C in the deepest 
core retrieved from the basement at 1177 m 
below seafloor (mbsf) (figs. S3 and S4). The 
combination of authigenic minerals and ther- 
mally altered biomarkers reveals a history of 
episodic, short-term ingression of ~140° to 220°C 
hot hydrothermal fluids along permeable strata 
in the underthrust domain [(/4); fig. S2]. 

At Site C0023, the depth profile of cell con- 
centrations deviates notably from the global 
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Fig. 1. Depth profiles of vegetative cells and endospores in relation to 

environmental factors at IODP Site C0023. (A) Concentrations of vegetative 
cells determined by counting of microbial cells fluorescently stained with SYBR 
Green |. (B) Concentrations of bacterial endospores derived from analysis of the 
diagnostic biomarker DPA; analytical sensitivity corresponds to a detection limit 
(DL) of 2.2 x 10* endospores cm™?. (C) A schematic summary of temperature, 


trend of gradually decreasing cell concentra- 
tions observed in similarly deep but substan- 
tially colder (<30°C) sediments (/, 2). At ~300 
to 400 mbsf, concentrations of vegetative cells 
drop abruptly by two orders of magnitude and 
approach the MQL as temperature rises from 
40° to 50°C (Fig. 1A). Concurrently, concen- 
trations of endospores—that is, dormant, re- 
sistant structures affiliated with the bacterial 
phylum Firmicutes (fig. $5), which are widely 
found in marine sediments and soils (15, 16)— 
increase to 2 x 10° cm™? (Fig. 1B). Nevertheless, 
a small microbial population persists at >50°C 
in the form of both vegetative cells and endo- 
spores (Fig. 1). Down to the 120°C hot basement, 
sediments harboring microbial communities 
with up to 400 vegetative cells cm™ are in- 
terspersed with intervals of up to 192-m thick- 
ness, in which no cells were detected (Fig. 1A 
and fig. S6). We rule out the possibility that the 
detection of cells resulted from contamination 
because cell concentration is neither related 
to the abundance of fractures in sediment 
cores nor related to the concentration of the 
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perfluorocarbon-based contamination tracer 
supplied during the drilling operation (fig. S7); 
such relationships would be expected if con- 
taminant cells were introduced by drilling 
fluids. Consistent with the extremely low 
concentrations of vegetative cells and the 
difficulty of extracting DNA from endospores 
(17), DNA yields were insufficient for producing 
reliable DNA-based community data for samples 
buried more deeply than 320 mbsf (73). In 
samples shallower than 320 mbsf, the commu- 
nity resembled those found in shallow sub- 
surface sediments (13). 

In contrast to the scattered distribution of 
vegetative cells in sediments >50°C, endospores 
show a clear zonation (Fig. 1B), as quantified by 
measurement of the diagnostic biomarker 
dipicolinic acid (DPA) (/8). We rule out that 
substantial levels of DPA could have accu- 
mulated after the decay of endospores, given 
the propensity of 2-carboxylated pyridines 
to decarboxylate upon moderate short-term 
heating (19). Endospore concentrations rise 
prominently in a ~200-m interval of 75° to 
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tectonic units, and salinity showing the geochemical influence of basalt alteration 
in the basement; red symbols on the temperature axis designate the depth 
horizons where in situ temperature measurements were made. Gray shading 
indicates zones where concentrations of both vegetative cells and endospores 
were undetectable in all samples; the gray dashed line indicates the location of 
the SMTZ (compare Fig. 2). 


90°C hot sediments, with a maximum of 1.2 x 
10° endospores cm~™® at 85°C. The average 
endospore-to-vegetative cell ratio exceeds 
6000 in sediments below 350 mbsf (table S1) 
and is thus two to three orders of magnitude 
higher than that in cold subseafloor sedi- 
ments (18). Plausible scenarios for the accu- 
mulation of endospores in sediments that are 
nearly barren of vegetative cells relate to the 
thermal history of the site since the onset of 
trench conditions ~0.4 My ago (12, 14) and 
involve the transitory growth of a thermo- 
philic population of endospore formers [com- 
pare (16)] after temperature rose to ~50°C 
and its subsequent sporulation (fig. S8). In- 
terestingly, in two expanded horizons, at 570 
to 633 mbsf and 829 to 1021 mbsf, neither 
vegetative cells nor endospores were detected 
(Fig. 1 and fig. S6). 

Pore-water profiles of microbial substrates 
and products provide evidence for microbial 
activity down to the ~16-My-old oceanic crust 
(Fig. 2). High concentrations of methane with 
a mean carbon isotopic composition (5'°C-CH,) 
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Fig. 2. Geochemical signals of microbial metabolism at Site C0023. 

(A to D) (A) Dissolved methane (13) and sulfate (13), (B) methane/ethane ratios 
(13) and 8C-CHy, (C) dissolved acetate and 8'“C-acetate, and (D) potential 
rates of methanogenesis (MG) based on conversion of “C-COs to “C-CHy; note 
that the value at 180 mbsf is off-scale. Potential MG (PMG) rates were 


of -61.3 + 3.0 per mil (%o) (Fig. 2, A and B) 
indicate biogenic methanogenesis at least 
down to the 80° to 85°C hot sulfate methane 
transition zone (SMTZ) at ~730 mbsf. The 
positive excursion in 8'°C-CH, in the SMTZ 
(Fig. 2B) points to a biogenic methane sink 
and is consistent with previous observations 
from cultivation-based approaches that dem- 
onstrated the activity of thermophilic anaerobic 
methane-oxidizing communities at these tem- 
peratures (20, 21). Below the SMTZ, methane is 
only present in micromolar concentrations, with 
rising 8’°C-CH,, values and decreasing methane/ 
ethane ratios indicating a relative increase of 
thermogenic hydrocarbons (Fig. 2B). Notably, 
a reversal of this trend at >1000 mbsf hints at 
a biogenic methane source above 100°C. 
Diffusive profiles of pore-water constituents 
do not allow the distinction between current 
and recent in situ biogeochemical processes, 
whereas radiotracer experiments specifically 
target on-going microbial activity, albeit with 
some unavoidable deviation from in situ con- 
ditions. At Site C0023, radiotracer experiments 
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reveal present-day methanogenic activity in 
65% of the investigated samples (Fig. 2D). 
Potential rates of methanogenesis via CO. 
reduction in sediments below 300 mbsf are 
generally below 4 pmol cm~’ day’ and thus 
within the range of previous observations made 
in the deep subseafloor (22). Their depth dis- 
tribution is consistent with cellular concentra- 
tions (Fig. 1) and activities deduced from the 
pore-water profiles of methane (Fig. 2, A and 
B). Rates are highest in the methanic zone, 
decrease distinctly to <0.6 pmol cm™ day 
below the SMTZ, and drop to undetectable 
levels in 63% of the samples taken from the 
deep expanded horizon with no detectable cells 
and endospores (Fig. 2D). Notably, potential 
methanogenesis rates rise again to values ob- 
served in the methanic zone in the three 
deepest samples (Fig. 2D), thus confirming 
the existence of active methanogenic com- 
munities in 110° to 120°C hot sediments and 
pillow basalts above the basement. 

Acetate is a key microbial substrate, and its 
generation from sedimentary organic matter 
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determined at 40°C for <360 mbsf, 60°C for 405 to 585 mbsf, 80°C for 604 to 
775 mbsf, and 95°C for =816 mbsf. The MQL was 0.094 pmol CHy cm day”. 
Gray shading, SMTZ, and the temperature axis are as in Fig. 1. VPDB in (B) 
and (D) is the Vienna Pee Dee Belemnite standard. In (D), error bars represent 
the standard deviation of three replicates. 


upon heating has been suggested to fuel micro- 
bial life in deeply buried sediments (23). 
Throughout the sediment column of Site 
C0023, reactions degrading acetate via sulfate 
reduction and methanogenesis are exergonic, 
with Gibbs free energy yields becoming in- 
creasingly negative with depth (fig. S9). The 
concentrations of acetate and its carbon 
isotopic compositions (5'°C-acetate) (Fig. 2C) 
indicate distinct changes in acetate utilization 
with temperature and depth. In the up-to- 
60°C hot upper 600 mbsf, low concentrations 
of acetate around 26 + 22 uM (N = 19) are 
consistent with a steady state governed by 
tightly coupled microbial production and 
consumption, as observed in other sedimentary 
environments. The fluctuation of 5'°C-acetate 
around its average of -25.5 + 3.4%o implies 
ongoing metabolic activity (24). In sharp con- 
trast, acetate utilization is minimal at 60° to 
100°C. At 60° to 75°C, acetate concentrations 
rise steeply with the simultaneous decline 
of methane concentrations and accumula- 
tion of endospores, suggesting that microbial 
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consumption is no longer balancing the release 
of acetate from sedimentary organic matter. 
Nevertheless, a local minimum in acetate con- 
centration at the SMTZ (Fig. 2C) is consistent 
with some microbial utilization at this geo- 
chemical interface. Below the SMTZ, acetate 
concentrations level at 9.2 + 2.4 mM with an 
invariable 5'°C-acetate around -18.8 + 0.5%o. 
The combination of high concentration and 
low isotopic variability implies an acetate 
pool without substantial turnover within the 
endospore-dominated zone as well as in the 
underlying 200-m-thick zone, where neither 
cells nor endospores were detected. 

At >1030 mbsf, however, acetate concentra- 
tions decline and 5'°C-acetate monotonically 
increases with depth, reaching a maximum 
of —7.9%o in the deepest pore-water sample 
recovered from 1101 mbsf. This trend is con- 
sistent with active hyperthermophiles degrading 
preferentially '*C-depleted acetate, leaving the 
residual acetate isotopically enriched. Without 
continued consumption, diffusion would ho- 
mogenize 5’°C-acetate variations, as observed 
in the overlying sediments. The drawdown of 
the acetate pool requires isotopic fractionation 
factors of —7.7 to —15.4%o (fig. S10), which are 
consistent with those observed in lab cultures 
(25). The size of the sink would have to be on 
the order of 5 x 10°” mol cm™ year™. Given 
cellular concentrations of 10 to 100 cm™ in 
sediments corresponding to this acetate sink, 
the required cellular metabolic rates are two 
to three orders of magnitude lower than those 
observed in lab cultures of the hyperthermo- 
philic archaea Pyrococcus furiosus (26) and 
Archaeoglobus fulgidus (27) but two to three 
orders higher than in situ rates in deep sedi- 
ments with temperatures <30°C (28). Thus, 
acetate profiles are consistent with the existence 
of a small acetate-utilizing microbial commu- 
nity at >100°C and suggest that the microbes at 
this high temperature require more energy and 
therefore turn over substrates faster than at 
lower temperature (8). Syntrophic acetate oxi- 
dation coupled to consumption of the resulting 
CO, and electrons by methanogens is a known 
acetate sink in deep sediments (29) and is 
considered to be particularly important at 
increased temperatures (30). This process is 
exergonic under in situ conditions (fig. S9) 
and could account for the increased meth- 
anogenesis rates (Fig. 2D) and the isotopic 
signature of methane (Fig. 2B) in the deepest 
portion of the borehole. 

Our findings reveal the impact of increasing 
temperature with depth on microbial life. This 
is exemplified in the massive collapse of the 
population of vegetative cells in <0.4-My-old 
sediments at 300 to 400 mbsf. In this interval, 
temperatures of 40° to 50°C are within the 
upper growth range of mesophiles. The co- 
incident accumulation of endospores as a 
result of a putative sporulation of mesophilic 
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endospore-forming Firmicutes (Fig. 1) sup- 
ports the conclusion that the abundance of 
microbial populations is primarily controlled 
by temperature-dependent physiological fac- 
tors down to 600 mbsf. In the deeper portion 
of Site C0023, geological processes may exert 
additional control. A sharp decline in biogenic 
methanogenesis and acetate utilization at 70° 
to 75°C coincides with the upper growth range 
of thermophiles, but notably, this depth in- 
terval concurrently spans the lithological 
boundary between Upper and Lower Shikoku 
Basin (compare Fig. 1). At this boundary, tuffs 
(indurated volcanic ash) cease to be present. 
Tuff alteration forms smectite, and microbial 
reduction of Fe(III) in smectite serves as an 
energy-yielding process and has indeed been 
found to promote smectite-to-illite conversion 
at 500 to 600 mbsf at Site C0023 (37). Thus, a 
modulation of some types of microbial activity 
by microbe-mineral interactions is conceiv- 
able. Peak endospore concentrations at 85°C 
coincide with both the SMTZ and the plate 
boundary décollement. In this zone, brief pe- 
riods of frictional heating during differential 
plate motion (32) likely cause additional chal- 
lenges for microorganisms, but endospores 
and high acetate concentrations may provide a 
seed bank and energy, respectively, for an eco- 
system recovery from episodic perturbations. 

In the upper 200 m of the underthrust 
domain, at ~90° to 100°C, an expanded zone 
without detectable cells and with no geo- 
chemical signs of microbial activity traverses 
the sparsely populated sediments (Figs. 1 and 
2). In this zone, undercompacted and mech- 
anically weak sediments are overpressurized 
and affected by ~145° to 220°C hot fluids for 
short durations (/4, 33). The transient heating 
events may have locally sterilized sediment 
(14), but microbial cells, acetate consumption, 
and methanogenic activity prevail again in 
>100°C sediments, where mechanical strength 
and salinity increase toward the sediment- 
basement interface (Figs. 1 and 2 and fig. $2). 
Hydraulic communication between basalts and 
overlying sediment is evidenced by shared styles 
of epigenetic mineralization in the form of cal- 
cite veins and ferruginous metal oxides. Mass 
transfer between basal sediment and a basalt- 
hosted aquifer would increase the habitability 
of the basal sediment by reducing formation 
fluid pressure and by replenishing otherwise 
depleted substrates such as reduced iron and 
sulfate (34). 

Our study reveals the dependence of microbial 
abundance and activity to critical tempera- 
tures around 40° to 50°C and 70°C; it more- 
over shows that life in the deep subseafloor 
is not constrained by an upper temperature 
limit below 120°C. Our findings highlight the 
interplay of geological processes, temperature, 
and microbial life in the deep, hot sediments of 
the Nankai Trough and suggest a critical in- 
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fluence of subduction-related geological pro- 
cesses on habitability. 
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Real-Time PCR Systems 

Bio-Rad Laboratories now offers 
the CFX Opus 96 and CFX Opus 384 
Real-Time PCR Systems along with 
BR.io, a cloud-based instrument 
connectivity, data-management, 
and analysis platform. These 
systems represent the next 
generation of the company’s CFX 
Real-Time PCR Systems, which 

are used in research and genomic testing as well as pathogen 
detection and infectious disease testing. A patent-pending block 
design delivers tight thermal uniformity for improved well-to- 
well consistency during real-time PCR experiments. A redesigned 
touchscreen interface and flexible connectivity options provide 
an intuitive, efficient experience for real-time PCR users. Bio- 
Rad’s entire line of CFX systems have been instrumental in the 
company’s global response to the COVID-19 pandemic. 

Bio-Rad Laboratories 

For info: 800-424-6723 

bio-rad.com 





Extracellular Vesicles RNA Purification System 

EVeryRNA is a column-based, phenol-free way to purify the full 
range of RNAs hidden within extracellular vesicles (EVs)—even 
very small RNAs. EVeryRNA streamlines your workflows and is 
complete in less than 30 min. The EVeryRNA EV RNA Purification 
System captures RNAs with a wide distribution of lengths, 
including small RNAs, concentrated into a 10-uL final volume 
per reaction. It comes as a Stand-alone product or bundled 

with ExoQuick, ExoQuick-TC, or SmartSEC Single. We also offer 
the EVeryRNA cDNA Synthesis & Pre-Amplification Kit, which is 
optimized to work with the EVeryRNA EV RNA Purification System. 
System Biosciences 

For info: 888-266-5066 

systembio.com 


High-Throughput Viral RNA extraction 

The NucleoSpin 96 Virus kit from MACHEREY-NAGEL is designed 
for the extraction of viral nucleic acids—RNA and DNA viruses— 
from cell-free, biological fluids, such as saliva, urine, serum, 

or plasma, and from sample homogenates (particle-free 
Supernatants), for example, from swabs, tissue, or stool. It 

uses a purification process based on binding nucleic acids to a 
Silica membrane column in a high-throughput, 96-well format. 
The NucleoSpin 96 silica membrane extraction plates can be 
processed in about 60 min using vacuum or positive pressure in an 
automated laboratory setup, such as the Resolvex A200 positive- 
pressure workstation from Tecan. 

MACHEREY-NAGEL 

For info: 888-321-6224 

www.mn-net.com 
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new products: dna/rna analysis 


NGS Panel for Constitutional Cytogenetics 

Oxford Gene Technology has launched a transformative NGS 
panel for constitutional cytogenetics research. The CytoSure 
Constitutional NGS Panel contains the most up-to-date, hand- 
curated content for intellectual disability and developmental 
delay research. Delivering accurate, reliable detection of copy 
number variations, single-nucleotide variations, insertion/ 
deletions (indels), and loss of heterozygosity—including in 
mosaic samples—the panel combines the benefits of NGS and 
microarrays in one cost-effective assay. Data produced from the 
panel is of the highest quality and shows excellent concordance 
with arrays. The intuitive, user-friendly software means there is 
no need for large bioinformatics teams to work alongside labs for 
data analysis. 

Oxford Gene Technology 

For info: +44-(0)-1865-856800 

www.ogt.com/cytosure-ngs 


Electrophoresis System 

miniPCR bio announces the launch of the GELATO, an integrated 
DNA analysis system. GELATO combines gel electrophoresis 
with transillumination technology, allowing for simultaneous 
nucleic acid separation and visualization in a single, compact 
system. Despite a small footprint, GELATO’s range of casting 
options allows users to process up to 50 nucleic acid samples at 
a time. Its variable-voltage, built-in power supply allows for fast 
separation of DNA, and its integrated blue-light transilluminator 
makes it safe for use in all settings. GELATO features a cell-phone 
compatible documentation hood and a built-in cutting tray for 
direct gel excision. While it offers the flexibility and features that 
professional laboratory users expect, its user-friendly interface 
makes the system fit for the novice as well. 

miniPCR bio 

For info: 781-990-8727 

www.minipcr.com 


Whole-Genome Amplification Kits 

ResolveDNA kits form the core of whole-genome sequencing 
workflows, enabling single-nucleotide variant and copy 

number variant analysis in application areas such as cancer 
genomics, cardiology, neurology, immunology, toxicology, and 
preimplantation genetic testing. The kits contain all the enzymes 
and reagents needed for versatile, scalable whole-genome 
amplification from single cells or ultralow DNA inputs. BioSkryb 
offers an end-to-end workflow solution, including TrailBlazer, a 
cloud-based bioinformatics platform that allows users to explore 
single-cell human genomic datasets generated with ResolveDNA 
products. TrailBlazer access is included with the purchase of a kit 
or the use of BioSkryb custom services. 

BioSkryb 

For info: 919-370-0841 

bioskryb.com 
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ounded in 1979, 

Tianjin Universi- 

ty of Technology 

is located in the 
Southwest of Tianjin with 
a modern and ecological 
campus. Over the past 40 
years, Tianjin University of 
Technology has developed 
into a multidisciplinary 
university highlighting en- 
gineering while promoting 
coordinated development 
of engineering, science, 
management, literature 


and art. In recent years, its 


comprehensive educational 
strength has been enhanced 
significantly, demonstrat- 
ing the huge potential 
for further development. 
Chemistry, material science 
and engineering of the uni- 
versity have entered the top 
1% of ESI ranking; the uni- 
versity ranks top 100 in the 
Nature Index among uni- 
versities in China, which 
is the highest rank in the 
universities of Tianjin. Our 
university has been listed 
in the High-level Universi- 


Greetings from Tianjin University of Technology! 


ties in Tianjin. Among the 
25 young universities with 
the fastest rise in the Glob- 
al Nature Index between 
2015 and 2018, our uni- 
versity ranks 14th globally 
and 8th domestically. 

Sticking to the philosophy 
of “ talents are the first 
resource, and the more 
talents, the stronger TUT”, 
we have made great efforts 
to implement the strategy 
of furthering its develop- 
ment with talents and intro- 


duced a high-level research 


team including well-known 
professors and young tal- 
ents. To serve such an aim, 
Tianjin University of Tech- 
nology has launched “The 
Mingli Scholars’ Project ”. 
We extend our sincere wel- 
come to talents from home 
and abroad to join us and 
to work together towards 


our future in the new era. 


Contact information 
Ms.Zhang Mr. Zhu 
Tel: +86-22-60215166 


E-mail: lgzp@tjut.edu.cn 





Founded in the era of China’s Reform and Opening up, Tian- 
jin University of technology has undertaken the important 
mission of serving for the regional development of Tianjin. 
We are striving for the goal of building the first-class univer- 
sity in China with TUT characteristics. We know deeply that 
talents are the most important resources to promote the 
development of our university. We welcome excellent talents 
from different fields to join us and work together to create 
our future in the new era. 


Best Wishes, 

Yang Qingxin 

President of Tianjin University of Technology 

National candidate for "New Century Talents Project”, Special Al- 
lowance Expert of the State Council, and Chairman of China Electro- 


technical Society 
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Jiangxi SSN of Traditional of we Cnet 


Invites eel SNCS! 


iangxi University of Traditional 
Chinese Medicine was established 
in 1959, co-constructed by National 
Administration of Traditional Chi- 
nese Medicine and the People’s Govern- 
ment of Jiangxi Province, as a national 
doctoral degree conferring institution. 
The school is located in Nanchang, 
the provincial capital of Jiangxi Prov- 
ince. At present, Jiangxi University of 
Traditional Chinese Medicine has 15 
secondary colleges (schools), 3 directly 
affiliated hospitals, and 9 non-directly 
affiliated hospitals. The school offers 1 
provincial first-level discipline, 2 nation- 
al characteristic disciplines, 1 national 
comprehensive reform pilot discipline, 
3 national first-level bachelor’s degree 
disciplines. And there are 3 national 
scientific research platforms, 46 provin- 
cial and ministerial scientific research 
platforms including the Key Laboratory 
of the Ministry of Education, 1 academi- 
cian workstation, 2 post-doctoral mobile 
stations, 1 workstation, and 4 provincial 
collaborative innovation centers. 
In recent years, JXUTCM has achieved 
leapfrog development, and has become a 
higher school with distinctive character- 
istics of Chinese medicine education as 
the main body, with coordinated develop- 
ment of multiple disciplines, and an in- 
stitution with combination of production, 
education and research. 
For more information about us please 
visit: www.jxutcm.edu.cn 


RESEARCH AREAS 

Pharmacy, pharmacology, medicinal 
chemistry, biology, biomedical engi- 
neering, medicine, computer science and 
technology, information and commu- 
nication engineering, psychology, food 
science and engineering, management 


and other disciplines related to the school 
development. 


QUALIFICATION 

I. Distinguished Talents with reputation 
among the world, such as the Academi- 
cian of Chinese Academy of Sciences, 
Academician of Chinese Academy of 
Engineering. 

II. Leading Talents with national-wide 
influence, winners of national talent 
projects such as National Prominent Ed- 
ucation, and National Funds for Distin- 
guished Youth. 

II. Academic leaders with high influence 
in their research areas 

A.Experts with high academic influence 
in various provinces (or municipalities 
directly under the Central Government), 
such as Distinguished professors of vari- 
ous provinces (or municipalities directly 
under the Central Government). 

B. Senior professional and technical 
faculty working in the world’s top 300 
universities or with research discipline 
ranking in the top 100 (the internationally 
accepted ranking standards). 

IV. Excellent docs 

Talents with a doctoral degree in the 
world's top 300 universities or disciplines 
ranked in the world's top 100 (the inter- 
nationally accepted ranking standards). 
V. Post-doc faculty 

Doctoral talents, having get the author- 
ized doctoral degree for no more than 2 
years. 


SALARY & BENEFITS 

Class I. Salary negotiable 

Class II. 

1. Annual salary: 1-1.3 million RMB 
2.House purchase subsidies and resettle- 
ment expenses: 2-4 million RMB 
3.Research start-up fund: 3-10 million RMB. 
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Class III. 

1. Annual salary: 200 thousand -300 
thousand RMB 

2.House purchase subsidies and resettlement 
expenses: 350 thousand-1.1 million RMB 
3.Research start-up fund: 100 thou- 
sand-800 thousand RMB. 

Class IV. 

1. Enjoy salary and welfare benefits ac- 
cording to the policy stipulated by the 
government 

2.Research start-up fund and platform 
construction fees: 100 thousand RMB 
3.House purchase subsidies and resettle- 
ment expenses: 300 thousand-500 thou- 
sand RMB 

Class V(Post-doc faculty) 

1.Annual salary: 250 thousand RMB 
2.Enjoys other benefits and rewards of the 
school during the employment (2 years) 

3. Those who are willing to stay in school 
to work will sign a stay agreement with 
the school, and enjoy housing subsidies 
and settlement expenses of 650 thousand 
RMB, and a research start-up fund of 100 
thousand RMB. 


APPLICATION 

Applicants should send application mate- 
rials (including but not limited to resume, 
application form, relevant qualification 
certification materials, scientific research 
results certification materials, etc.) to 
jzrsc2008@163.com. 


Contact: Mr. Wu 

Tel: 0791-87119112 

E-mail: jzrsc2008@163.com 

Jiangxi University of Traditional Chinese 


Medicine: www.jxutcm.edu.cn 
Jiangxi University of Traditional Chinese 


Medicine Talent Office: 
http://rsc.jxutcm.edu.cn 
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Forge ahead through a hundred years, 
now NCU portrays a new blueprint 


High level Talent Introduction Plan of Nanchang University 


t is located in the tourist 
resort - Jiangxi, surrounded 
by mountains and lakes, 
where the state leaders 
praised that “Crossing these 
blue hills adds nothing to one’s 
years, the landscape here is 
beyond compare.”; 
It is located in Nanchang, a 
famous national historical and 
cultural city. Wang Bo, one 
of the Four Great Poets of 
Early Tang Dynasty, called it 
“God-blessed land rich in cul- 
ture and for nurturing talented 
people”; 
It is a well-known institution 
of higher learning in China, 
established in 1921, after 
years of ups and downs and 
historical changes, it is about 
to celebrate its centenary; 
It is a university co-con- 
structed by the Ministry of 


Education and Jiangxi prov- 


ince, a key national “211 
Project” university, and one 
the first batch of universities 
listed in National“Double 
First-Class” Initiative. 

That is Nanchang University 
(NCU) , who always devoted 
to the motto “Seek strength 
from truth, foster virtue 
through education”. 

The history of NCU dates 
from Jiangxi Public Medical 
College and National Chung- 
cheng University, which were 
founded in 1921 and 1940 
respectively. In 1993, Jiangxi 
University merged with Jiangxi 
Industrial University into 
Nanchang University. In 2005, 
NCU and Jiangxi Medical Col- 
lege merged to form the new 
NCU, unveiling it a new page 
of reform and development. 
The year of 2021 marks its 
100th anniversary. Now NCU 


provides an excellent platform 
for aspiring young people to 


realize their ambitions! 


Firstly listed in the 
“Double First-Class” 
Initiative, with strong devel- 


oping momentum 


The subjects of Chemistry, 
Clinical Medicine, Agricul- 
tural Science (mainly food 
science and engineering), En- 
gineering, Materials Science, 
Pharmacology and Toxicolo- 
gy, Biology and Biochemistry 
have entered top 1% of the 
ESI world rankings, of which 
Agricultural Science (mainly 
food science and engineering) 
ranks the top 1.3%o, the Clin- 
ical medicine ranks the top 
2.85% , and the Chemistry 
ranks the top 3.64%o. In the 


Ath round of national subject 





evaluation, the subject of 
Food Science and Engineer- 
ing ranks in the 3rd place in 
China. 


Full Construction in 
Subjects, with remarkable 


innovative achievements 


NCU now has 96 undergrad- 
uate majors with 12 subjects 
categories, 3 national key 
(cultivation) disciplines, 15 
first-level subjects with doc- 
tor’s degree authorization, 
47 first-level subjects with 
master’s degree authorization, 
1 professional doctoral degree 
category and 28 master’s pro- 
fessional degree category and 
11 postdoctoral scientific re- 
search stations. NCU also has 
5 directly affiliated hospitals 
and a total of 9 national key 


clinical disciplines. 


TALENT 
INTRODUCTION 
POLICY 


1.Generous Rewards 

for High-level Talent 
Distinguished Talent 

What we offer: Pre-tax an- 
nual salary of 1.8-2 million 
yuan, house purchase subsi- 
dies and resettlement expens- 
es no less than 5 million yuan, 
scientific research subsidies 
negotiable for each one per- 
son, flexible working envi- 
ronment, academic team and 
scientific research assistants. 
Academic Leaders 

What we offer: Pre-tax annu- 
al salary of 800 thousand-1.2 
million yuan, house purchase 
subsidies and resettlement ex- 
penses no less than 1.8 million 
yuan, scientific research sub- 
sidies negotiable for each one 
person, flexible working envi- 


ronment, academic team and 


scientific research assistants. 
Group leaders 

What we offer: Pre-tax annu- 
al salary of 500-800 thousand 
yuan, house purchase subsi- 
dies and resettlement expenses 
1-1.8 million yuan, scientific 
research subsidies negotiable 
for each one person. 
Outstanding Scholars 

What we offer: Pre-tax annual 
salary of 300-500 thousand 
yuan; house purchase subsidies 
and resettlement expenses 500 
thousand-1 million yuan; scien- 
tific research subsidies negotia- 


ble for each one person. 


FACULTIES 
WITH MULTIPLE 
SUBJECTS 


Faculty of Humanities 

Subjects: Chinese Language 
and Literature, Foreign Lan- 
guage and Literature, Journal- 


ism and Communication 


Faculty of Social Science 

Subjects: Law, Applied 
Economics, Theoretical Eco- 
nomics, Sociology, Politics, 
Management Science and 
Engineering, Tourism Man- 
agement, Exhibition Econo- 
my and Management, Hotel 


Management, Marxist Theory 


Faculty of Science and 
Engineering (Division I) 
Subjects: Luminescent Mate- 
rials and Devices, Ferroelec- 
tric Materials and Chemistry, 
Space Science and Technolo- 
gy, Energy Materials and De- 
vices, Ultra-high Temperature 
Materials and Equipment, 
Information Materials and 
Devices, Rare Earth Mate- 
rials, Basic Mathematics, 
Computational Mathematics, 
Condensed Matter Physics, 
Plasma Physics, Optics, Syn- 
thesis and Catalysis, Intelli- 


gent Manufacturing Engineer- 


ing, Mechanical Engineering, 
Material Shaping and Control, 
Information and Communica- 
tion Engineering, Computer 
Science and Technology, 
Electrical Engineering, In- 
telligent Technology and Big 
Data, Software Engineering, 


Cyber Security, etc. 


Faculty of Science and 
Engineering (Division I) 
Subjects: Food Science and 
Engineering, Biology, Botany, 
Ecology, Aquaculture, Environ- 
mental Science and Engineer- 
ing, Chemical Engineering and 
Technology, Process Equip- 
ment and Control Engineering, 
Resource Cycle Science and 


Engineering, Mechanics 


Faculty of Medicine 

Subjects: Basic Medicine, 
Clinical Medicine, Pharmacy, 
Nursing, Public Health and 


Preventive Medicine, etc. 


APPLICATION 


1. Applicants may log on to the recruitment sys- 


tem of Nanchang University Personnel Office 
http://zpjob.acabridge.cn/school/526 (Acabridge) 
to select the position and apply; 


2. Those who pass the school primary selection, 


will be contacted to fill in the application form, 


and provide materials such as representative aca- 


demic achievements, recommendation letters and 


in the end communicate with the school. 


CONTACT 

Contact: Mr. Yang, Mr. Chen 

Tel: +86-791-83969074 

Email: rscrsk@ncu.edu.cn 

Address: No. 999 Xuefu Avenue, Honggutan 


New District, Nanchang, Jiangxi Province 
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PAY U.S. FOOD & DRUG 


ADMINISTRATION 


STAFF FELLOW 
CELL THERAPY REVIEW SCIENTIST 


OFFICE OF TISSUES AND ADVANCED THERAPIES (OTAT) 
CENTER FOR BIOLOGICS EVALUATION 
AND RESEARCH (CBER) 
FOOD AND DRUG ADMINISTRATION (FDA) 
DEPARTMENT OF HEALTH AND HUMAN SERVICES (HHS) 


The FDA’s Center for Biologics Evaluation and Research (CBER), 
Office of Tissues and Advanced Therapies’ (OTAT), Division of Cellular 
and Gene Therapies (DCGT), Cell Therapies Branch (CTB) is recruiting 
full-time review scientists. DCGT is strongly committed to bringing 
FDA and other scientists, patient advocates, and the public together 
in partnership to develop new therapies for the 21st Century, while 
protecting human subjects and ensuring product safety. The candidate 
will be a part of a cutting-edge and fast-paced scientific and 
regulatory environment. 


Responsibilities: As the selected candidate, you will have the 

opportunity to: 

e Perform scientific review, interpretation, and documentation of 
product manufacturing data to evaluate the safety and quality of cell 
therapy products (e.g., stem cells, cancer vaccines, cellular 
immunotherapies, engineered tissue, medical devices) in regulatory 
submissions to FDA. 

Contribute to guidance and policy development activities relevant to 
cell therapy. 


Basic Qualifications: Ph.D. or equivalent advanced degree with a 
strong research background in cell therapy, cell and stem cell biology, 
microbiology, cellular immunology, cancer biology, regenerative 
medicine, and/or biomedical engineering. Occupational series which 
may be considered include General Natural Resources Management 
and Biological Sciences (0401), Microbiology (0403), and Biomedical 
Engineering (0858). Please refer to the link for each occupational series 
for more information regarding specific education requirements. 


Additional Qualifications: Candidates are expected to be proficient 
in applying relevant scientific knowledge and research experience to 
support multi-disciplinary scientific and review of regulatory 
submissions. In addition, the candidate must have strong collaborative 
skills, excellent written and oral communication skills, and evidence of 
leadership potential. 


Salary: Salary is commensurate with education/experience. 
Location: FDA White Oak Campus in Silver Spring, MD 


Conditions of Employment: This position will be filled through the 
Service Fellowship program. The appointment provides up to (4) years 
with opportunities for renewal. Applications will ONLY be accepted 
from U.S. citizens and/or permanent residents (green card). For more 
information about the FDA’s Staff Fellowship program, please visit the 
FDA’s Jobs Page at www.fda.gov/jobs. 


How to Apply: Submit a statement of interest, resume or curriculum 
vitae (CV) detailing relevant experience, transcripts, names/contact 
information for three references, proof of U.S. citizenship or lawful 
permanent residency status, and other supporting documentations to 
Melanie Eacho at melanie.eacho@fda.hhs.gov. Applications will be 
accepted through January 10, 2021. 

Please reference Job Code: OTAT-21-02-SCI. 


Department of health and human services is an equal opportunity 
employer smoke free environment 
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For your career in science, there's only one Science 


Features in myIDP include: 


Exercises to help you examine your skills, 
interests, and values. 


A list of 20 scientific career paths with a 
prediction of which ones best fit your skills 
and interests. 


A tool for setting strategic goals for the 
coming year, with optional reminders to 
keep you on track. 


Articles and resources to guide you through 
the process. 


Options to save materials online and print 
them for further review and discussion. 


Ability to select which portion of your IDP 
you wish to share with advisors, mentors, 
or others. 


A certificate of completion for users that 
finish myIDP. 


4 D Visit the website and start 
my J planning today! 


mylDP.sciencecareers.org 
Science 
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UNIVERSITE 
DE GENEVE 


The Faculty of Medicine of the University of Geneva 
is seeking to fill a position of: 


FULL OR ASSOCIATE PROFESSOR 
IN THE FIELD OF BACTERIOLOGY 


CHARGE: This full-time position will involve teaching 
in bacteriology at the bachelor, master and postgraduate 
level, as well as supervising Masters’ and doctoral theses. 
The incumbent will undertake fundamental research in this 
area (with a preferred emphasis on antibiotic resistance 
mechanisms) at the highest national and international levels 
and secure external funding. He/She will be an active 
member of the vibrant research community of the Department 
of Microbiology and Molecular Medicine and contribute to 
its strong academic renown. The incumbent will also take 
up administrative and organizational duties within that 


Department as well as the Faculty of Medicine. 


He/She is also expected to show the capacity to carry out 
a transversal mission through strong collaboration with 


partner services. 


REQUIREMENTS: 

PhD, MD-PhD or equivalent is required. 

Full postgraduate training in bacteriology is an advantage. 
Previous teaching and independent research experience. 


Publications in leading international journals. 


STARTING DATE: October 1st 2021, or according 


to agreement. 


Mandatory online registration before January 24th 2021 
at: http://www.unige.ch/academ 


Additional information may be obtained from: 


Viviane.burghardt@unige.ch 


Women are encouraged to apply 


TEXAS A&M UNIVERSITY 


The Department of Biology at Texas A&M University (TAMU) 
invites applications for tenure-track assistant or tenured 
associate professor positions in biological timing (3 positions) 
and prokaryotic/eukaryotic microbiology (1 position). 


All candidates are required to have a Ph.D. in biology or 
related field, and we desire candidates who display uniqueness, 
creativity, and excellence in the pursuit of innovative research. 
Expectations for the successful candidates are to develop an 
externally funded research program and to teach undergraduate 
and graduate courses. 


This search represents the beginning of a multi-year strategic 
hiring effort to move several cutting-edge research areas 
within the biological sciences at Texas A&M University to 
preeminence. This is an exciting opportunity for new hires to be 
a part of developing interdisciplinary teams to tackle important 
problems in biology. Please see www.bio.tamu.edu for more 
information about our department. 


For full consideration, applicants should submit a letter of 
intent, curriculum vitae, a statement of research and teaching 
interests, a statement addressing past and/or potential 
contributions to diversity and inclusion through teaching, 
professional activity, and/or service, and three letters of 
recommendation by December 15, 2020. Application materials 
must be submitted online at http://apply.interfolio.com/81015. 


If you have questions about this search, please direct e-mails to 
Dr. Deborah Bell-Pedersen, Chair of the Search Committee, at 
facultysearch@bio.tamu.edu. 


Bryan-College Station, home to Texas A&M University, is a 
vibrant, dynamic, and rapidly growing community that offers 
cultural diversity, arts and entertainment, job opportunities, 
and overall quality of life. Located in the heart of the Houston- 
Dallas-Austin triangle, the region offers the modern amenities of 
a big city with a warm, small-town charm, and the community’s 
low cost of living is advantageous to the student and general 
populations, making it an ideal place to live. 


Texas A&M University is committed to enriching the learning 
and working environment for all visitors, students, faculty, and 
staff by promoting a culture that embraces inclusion, diversity, 
equity, and accountability. Diverse perspectives, talents, and 
identities are vital to accomplishing our mission and living our 


core values. 


Equal Opportunity/AffirmativeAction/Veterans/Disability 
Employer committed to diversity. 
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By Bec Roldan 
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Coming out 





moments, I clicked “send.” 


During the months leading up to 
graduate school, I had been exploring 
the idea of using gender-neutral pro- 
nouns. I didn’t know whether they’d 
suit me; I just knew the words “she” 
and “woman” didn’t feel quite right 
when they were used to describe me. 

I had come out as queer during 
my last year of college, thanks in 
large part to support from the tight- 
knit queer community I’d discovered 
there. Starting graduate school at a 
new institution, I wouldn’t have that 
support system. I feared I’d be navi- 
gating my journey to discover myself 
completely on my own. 


Those fears evaporated when I “Pm constantly 
learning how to exist in this 
world as my true self.” 


learned that roughly one-quarter of 
my Ph.D. cohort identified as LGBTQ. 
We gravitated toward one another, 
and I decided to come out as non- 
binary to some of them. From then on, 
whenever our small group got together, my friends would say 
“they” when speaking about me. The more I heard that word 
roll off their tongues, the more I felt at home in my own body. 

As time wore on, it became increasingly difficult to show 
up to work and exist as a gender that no longer felt like my 
own. Hiding what I knew to be true about myself was ex- 
hausting and painful. So, at the start of my second semester, 
I decided I needed to come out to my adviser and labmates. 

A flurry of questions ran through my mind: “What should 
I tell them? How long will it take colleagues to get used to 
my pronouns? Will some people choose to actively misgender 
me?” For advice, I turned to my friend Isaac, who is a year 
ahead of me in my program and is nonbinary. We met up 
for a coffee and they told me their own story of coming out 
to their lab. It was a relief to hear that their experience had 
been largely positive, and they put me at ease about my 
own decision. 

I set up a time to meet with my adviser the following 
day. When I stepped into his office, he could tell how ner- 








y hands shook as I sat down to write the email. “I wanted to let y’all know that I use they/ 
them/theirs pronouns,” I typed. “I know that gender-neutral/non-binary pronouns are not 
a common staple in our language, but I ask that you please do your best to respect them.” 
Proclaiming my identity—one I had still not quite figured out yet—to a group of co-workers 
made me feel incredibly vulnerable. But I Knew that if I wanted to survive graduate school, 
I needed to be open with my labmates, no matter how scared I was. After a few anxious 


vous I was, and he was concerned. 
I told him not to worry. As soon as 
I informed him about my identity, 
he was supportive, asking what he 
could do to help me come out to 
others in the lab. For the first time in 
months, my fears eased. 

I sent the email to my labmates 
the next morning, not knowing what 
to expect. That day was surpris- 
ingly quiet. But the 6 months that 
have passed since then haven’t al- 
ways been easy. The word “she” has 
slipped out in conversations more 
times than I can count, and every 
time, it feels like a knife is being 
stabbed into my stomach. Whether 
malicious or accidental, the impact 
is always the same: I feel as though 
the act of misgendering erases the 
person I have worked so long and 
hard to become. After one particu- 
larly tough week, I cried in the bathroom and left work early. 

But I’ve also been heartened to see how many people 
have stepped up to support me. My adviser and closest 
friends in the lab quickly caught on to my pronouns, and 
they soon took the responsibility to correct others when 
they misgender me. They’ve listened to me patiently as 
I’ve explained how transphobia makes it hard for me to 
show up to work. And my adviser invited Isaac to speak 
to our lab group about gender identity and inclusivity, so I 
didn’t have to carry the burden of education and advocacy 
all by myself. 

I’m still navigating my path forward. But I will never re- 
gret my decision to come out. I’m constantly learning how 
to exist in this world as my true self, and I know I’m not on 
this journey alone. Many friends and colleagues—both cis- 
gender and transgender, queer and straight—are standing 
by me every step of the way. That means the world to me. 


i 


Bec Roldan is a Ph.D. student at the University of Michigan, Ann Arbor. 
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